Run spec tests in parallel to reduce the execution time by stevenfontanella · Pull Request #8088 · WebAssembly/binaryen

stevenfontanella · 2025-12-03T21:38:16Z

Reduces runtime from ~15 minutes to 1.5 minutes on my machine

Before:

After:

Failed test example (stdout and stderr isn't ordered with the exception unfortunately, but it's easy to re-run the particular test):

Run tests through a thread pool with os.cpu_count() * 2 threads
- os.cpu_count() * 4 shows no benefit. os.cpu_count() or os.cpu_count() // 2 also show no regression in runtime, but might be worse for machines with less cores. There are currently 315 spec tests total for reference.
Prefixes round-trip file name tests with their test name to avoid clobbering the a.wasm / ab.wast files during tests
Add stdout and stderr params to functions that print so that lines can be captured by each thread and not interleaved
- Note that we pass stdout as the stderr param in practice so that they are interleaved, otherwise all stdout lines and stderr lines will be outputted together in each test.

tlively · 2025-12-04T05:14:28Z

The alpine builder has been running for over three hours now, so I think there's a problem here. Looking at the log, it looks much more verbose than before because the stdout from the wasm-shell commands is being printed. Maybe fixing that will make the builder go faster?

kripken

Nice work!

kripken

lgtm % open comments

stevenfontanella · 2025-12-05T00:08:59Z

Re:

The alpine builder has been running for over three hours now, so I think there's a problem here. Looking at the log, it looks much more verbose than before because the stdout from the wasm-shell commands is being printed. Maybe fixing that will make the builder go faster?

I changed support.run_command to not print the process's stdout and fixed the Alpine build (it's using a lower Python version and didn't have Queue.shutdown())

stevenfontanella · 2025-12-05T00:43:50Z

I think the CI problem is that this member isn't initialized in the default constructor of SmallVector since std::array is an aggregate type. I'm not sure why we see the breakage here and not in main or in other architectures. I'll try to re-run for now and send a PR to fix separately.

CI link

kripken · 2025-12-05T15:38:57Z

About the std::array initialization error, we do disable that warning a few lines above for some archs, and maybe we just need to disable it in more. gcc does seem to have many such false positives...

…8088) Reduces runtime from ~15 minutes to 1.5 minutes on my machine * Run tests through a thread pool with `os.cpu_count()` threads * `os.cpu_count() * 4` shows no benefit. `os.cpu_count() // 2` also shows no regression in runtime, but might be worse for machines with less cores. There are currently 315 spec tests total for reference. * Prefixes round-trip file name tests with their test name to avoid clobbering the a.wasm / ab.wast files during tests * Add stdout and stderr params to functions that print so that lines can be captured by each thread and not interleaved * Note that we pass stdout as the stderr param in practice so that they are interleaved, otherwise all stdout lines and stderr lines will be outputted together in each test.

sbc100

I just came here to say things change is awesome. Why didn't we do this earlier!

sbc100 · 2025-12-15T16:38:18Z

+# Hack to allow subprocess.Popen with stdout/stderr to StringIO, which doesn't have a fileno and doesn't work otherwise
+def _process_communicate(*args, **kwargs):
+    overwrite_stderr = "stderr" in kwargs and isinstance(kwargs["stderr"], io.StringIO)
+    overwrite_stdout = "stdout" in kwargs and isinstance(kwargs["stdout"], io.StringIO)


Ha, this looks very similar to the code we have in emscripten test suite for this same purpose: https://github.com/emscripten-core/emscripten/blob/cb4450380c27e9be81e7c239ec7af5005ad283cd/test/common.py#L1153-L1181

Great minds think alike (or use the same AI :)

stevenfontanella · 2025-12-15T17:46:38Z

Thank you Sam!

stevenfontanella force-pushed the multithread branch 3 times, most recently from 44c5834 to 62b613b Compare December 4, 2025 01:19

stevenfontanella changed the title ~~Add multithreading for spec tests~~ Run spec tests in parallel to reduce the execution time Dec 4, 2025

Add thread pool for spec tests

bb0d612

stevenfontanella force-pushed the multithread branch from 62b613b to bb0d612 Compare December 4, 2025 01:55

stevenfontanella marked this pull request as ready for review December 4, 2025 01:56

stevenfontanella requested review from kripken and tlively December 4, 2025 01:56

tlively reviewed Dec 4, 2025

View reviewed changes

Comment thread scripts/test/support.py Outdated

Comment thread check.py Outdated

Comment thread check.py Outdated

Comment thread check.py Outdated

kripken reviewed Dec 4, 2025

View reviewed changes

Comment thread check.py

Comment thread check.py Outdated

PR Fixes + try fix for lower Python version to fix Alpine CI

0295f43

kripken approved these changes Dec 4, 2025

View reviewed changes

Comment thread check.py Outdated

stevenfontanella added 2 commits December 4, 2025 23:55

PR fixes

8b6039f

Fix _process_communicate

659663c

Try fixing std::array initialization

97fe435

kripken reviewed Dec 5, 2025

View reviewed changes

Comment thread src/support/small_vector.h

Remove unrelated fix

30892d5

stevenfontanella requested a review from tlively December 5, 2025 19:17

stevenfontanella added 2 commits December 5, 2025 20:11

Merge branch 'main' into multithread

184f986

Use more precise base name logic

008194e

stevenfontanella enabled auto-merge (squash) December 5, 2025 20:40

stevenfontanella merged commit 28e849b into main Dec 5, 2025
17 checks passed

stevenfontanella deleted the multithread branch December 5, 2025 21:10

sbc100 mentioned this pull request Dec 15, 2025

[test] Remove test/spec/expected-output. NFC #7385

Closed

sbc100 reviewed Dec 15, 2025

View reviewed changes

stevenfontanella mentioned this pull request Dec 17, 2025

Simplify thread pool used in spec tests runner #8143

Merged

Conversation

stevenfontanella commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tlively commented Dec 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kripken left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kripken left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stevenfontanella commented Dec 5, 2025

Uh oh!

stevenfontanella commented Dec 5, 2025

Uh oh!

kripken commented Dec 5, 2025

Uh oh!

Uh oh!

Uh oh!

sbc100 left a comment

Choose a reason for hiding this comment

Uh oh!

sbc100 Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

stevenfontanella commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

stevenfontanella commented Dec 3, 2025 •

edited

Loading