Bench 31132 saltedtxid by andrewtoth · Pull Request #183 · bitcoin-dev-tools/benchcoin

andrewtoth · 2026-05-01T12:43:36Z

No description provided.

Adds build configuration, benchmarking CI workflows, Python dependencies, plotting tools, and documentation for benchcoin. Co-authored-by: David Gumberg <davidzgumberg@gmail.com> Co-authored-by: Lőrinc <pap.lorinc@gmail.com>

- Fix empty chart: use get_chart_data() instead of to_dict() so JS filters can match config strings ("450", "32000") instead of objects - Capture machine specs on self-hosted runner during build job and pass via --machine-specs flag to nightly append, instead of detecting on the ubuntu-latest publish runner

Run LogParser + PlotGenerator from bench/analyze.py during artifact copying to produce static PNG charts from debug.log files. This pre-generates the same 11 chart types that were previously rendered client-side via JavaScript. Changes to report.py: - Import HAS_MATPLOTLIB, LogParser, PlotGenerator from bench.analyze - _copy_network_artifacts: generate plots after each debug.log with "{network}-{name}" prefix (e.g. "450-uninstrumented-pr") - _copy_artifacts: generate plots for single-directory mode, including when input_dir == output_dir - _prepare_graphs_data: add "plots" key with relative paths to PNGs - generate(): reorder to copy artifacts before HTML rendering so _prepare_graphs_data can find the generated plot files Plot generation is guarded by HAS_MATPLOTLIB for graceful fallback when matplotlib is unavailable.

The pr-report.html template previously included debug-log-charts.html which fetched multi-hundred-MB debug.log.gz files in the browser, decompressed them with pako.js, parsed every line, and rendered 11 Plotly charts client-side. This made report pages unresponsive. Now that report.py pre-generates the charts as static PNGs: - pr-report.html: replace the debug-log-charts.html include with an img loop over graph.plots, using loading="lazy" - debug-log-charts.html: delete (344 lines of client-side JS) - base.html: remove pako.js and Plotly CDN scripts (both are independently included by pr-chart.html and nightly-chart.html via their own script tags) The debug.log download link is preserved.

Rewrite to document the TOML config + matrix entry workflow, removing stale references to the old two-commit comparison CLI, --datadir requirement, profiles, and BENCH_DATADIR env var.

Debug logs were consuming 388MB on gh-pages. They are already uploaded as CI artifacts with 90-day retention during benchmark runs. - Remove gzip compression and copying of debug logs in report generation - Remove debug log extraction in publish-results workflow - Replace per-graph "Download debug.log" links with a single link to the CI run page where artifacts can be downloaded - Keep matplotlib plot generation from debug logs (plots are still generated during report phase, just the raw logs aren't published)

The PR comment with result links was posted before GitHub Pages finished deploying, leading to broken links. Add a wait-for-pages job that polls for the pages-build-deployment run matching our exact gh-pages commit, then blocks until it completes.

FITRIM ioctl requires the filesystem mount point. Resolve it from the tmp_datadir path by walking up to the mount boundary.

Manual (workflow_dispatch) runs are now stored separately from scheduled nightly runs. Scheduled runs still dedup by (date, commit, dbcache) to handle retries. Manual runs always append, appearing as diamond markers on the chart alongside the nightly trend line. Also ruff format.

Manual (workflow_dispatch) runs no longer get a separate "(manual)" legend entry with diamond markers. They appear as regular points in the same series trace as scheduled runs.

Adds a separate benchmark job (benchmark-noav) that runs IBD with -assumevalid=0 to measure full script verification performance. Uses a dedicated TOML config with uninstrumented-only matrix, and prefixes artifacts with noav- so the publish workflow can handle them alongside existing runs.

Introduce CoinsViewOverlay::StartFetching, which maps all input prevouts of a block to a new m_inputs vector of InputToFetch elements. Returns a ResetGuard which is lifetime bound to the block, while the InputToFetch elements are lifetime bound to the block as well. Introduce StopFetching to clear the m_inputs vector. CCoinsViewCache::Reset is made virtual and is overridden in CoinsViewOverlay. StopFetching is called on Reset, so the InputToFetch objects will not exceed the lifetime of the block. Introduce ProcessInput to fetch the utxo of an individual input in m_inputs. Each caller fetches the input at m_input_head and increments it, so each call will fetch the next input in the queue. Fetch coins from the m_inputs vector in FetchCoinFromBase by scanning all inputs until we discover the input with the correct outpoint. This is designed deliberately so multiple threads can call ProcessInput independently. Co-authored-by: l0rinc <pap.lorinc@gmail.com> Co-authored-by: Hodlinator <172445034+hodlinator@users.noreply.github.com>

Inputs are accessed by ConnectBlock in the same order as they are created in StartFetching (excepting BIP30 checks). We can use this information, as well as the fact that CoinsViewOverlay caches coins accessed via FetchCoinFromBase, to skip scanning over previously accessed coins. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Collapses a 32-byte Txid into a uint64_t, using 4 random uint64_ts. Used in place of a hash function as a performance improvement. Co-authored-by: Pieter Wuille <pieter@wuille.net>

This is a performance improvement, because we can skip checking on disk that the input does not exist. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Prepares for ProcessInput to be called from multiple threads. This flag acts as a memory fence around InputToFetch::coin. There is no lock guarding reads and writes of the coin field. Instead we use the flag's release/acquire semantics to ensure that when the main thread reads the coin it will have happened after a worker thread has finished writing it. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Prepares for ProcessInput to be called from multiple threads. ProcessInput reads from base. For ProcessInput to be safe to call in parallel on separate threads, it must not be mutated. Flush, Sync, and SetBackend can modify base, so we override these and StopFetching before calling the base class. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Add a configuration option for the number of worker threads used for parallel UTXO input fetching during block connection. Default is 4 threads, max is 15, 0 disables parallel fetching.

Prepares for ProcessInput to be called from multiple threads. Introduce a ThreadPool shared pointer to CoinsViewOverlay. A pool managed externally can be passed in the constructor. A global thread pool is used in fuzz harnesses since iterations can happen faster than the OS can create and tear down thread pools. This can cause a memory leak when fuzzing. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Leverages the thread pool to fetch inputs on multiple threads, while the overlay serves inputs on the main thread. This is a performance improvement over blocking the main thread to fetch inputs. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Co-authored-by: l0rinc <pap.lorinc@gmail.com> Co-authored-by: sedited <seb.kung@gmail.com>

…reads

Provides a worst-case upper bound on the number of inputs that can fit in a block so callers (e.g. parallel input prefetching) can pre-allocate stable storage and rule out reallocation of per-input state. Adapted from PR bitcoin#9938 (Lock-Free CheckQueue) which introduced the same constants under different names (MIN_TXIN_SERIALIZED_SIZE, MAX_TXINS_PER_BLOCK). Co-authored-by: Jeremy Rubin <jeremy.l.rubin@gmail.com> Made-with: Cursor

Variant of the previous commit that uses SaltedTxidHasher (SipHash-2-4) for bucket selection in m_txids instead of QuickHasher (sum-of-XORed-chunks). Equality is unchanged (full Txid) so semantics are identical; the only difference is hash strength vs hashing cost on the StartFetching hot path. Pushed for benchmarking against the QuickHasher variant. Made-with: Cursor

github-actions · 2026-05-01T20:34:03Z

Benchmark Results

Comparison to nightly master (median of last 7 runs):

450 MB: 30 min (nightly median of 7: 44 min, 2026-04-23 to 2026-04-30) → +30.3% faster
32000 MB: 32 min (nightly median of 7: 38 min, 2026-04-23 to 2026-04-30) → +14.6% faster
noav-450 MB: 63 min (no nightly baseline)
noav-32000 MB: 66 min (no nightly baseline)

View detailed results
View nightly trend chart

willcl-ark and others added 30 commits April 30, 2026 04:28

benchcoin: add tooling

50a568a

Adds build configuration, benchmarking CI workflows, Python dependencies, plotting tools, and documentation for benchcoin. Co-authored-by: David Gumberg <davidzgumberg@gmail.com> Co-authored-by: Lőrinc <pap.lorinc@gmail.com>

don't compare to master in prs

e02107e

only run single bins in prs

15d5723

rebase at 0100 GMT

45eec4c

make charts taller

8721de5

update machine configs and charts

fece8da

chart: make chart series dynamic and unique

c576ee4

rename history file

e8edc38

use better colours in charts

9404f57

don't use inline html

8c63bee

use commit date in chart data points

ee95e61

use nix flake in both publish workflow steps

4a61ed0

fix nightly-history mismatch

22c3cfd

fix instrumented suffixes in reports

e066383

add clickable plotly links

3e6bec3

use corect path in index

321bb08

use scatter plot for leveldb compaction

65ac9b0

add debug logs to artifacts

b59ea9a

dynamic charts test

097ddab

fix theme render order

cc9095f

add ruff and ty to flake

833fbf8

add ty.toml

fa00d49

add ruff.toml

2d958f7

support a full IBD PR run

ff14496

Update bench/README.md to reflect current CLI interface

08ef37e

Rewrite to document the TOML config + matrix entry workflow, removing stale references to the old two-commit comparison CLI, --datadir requirement, profiles, and BENCH_DATADIR env var.

willcl-ark and others added 21 commits April 30, 2026 04:28

fstrim the mount point, not a subdirectory

e6def88

FITRIM ioctl requires the filesystem mount point. Resolve it from the tmp_datadir path by walking up to the mount boundary.

Compare PR benchmarks against median of last 7 nightly runs

b318990

merge manual nightly runs into their series on the chart

51fd6b8

Manual (workflow_dispatch) runs no longer get a separate "(manual)" legend entry with diamond markers. They appear as regular points in the same series trace as scheduled runs.

coins: introduce QuickHashHasher

fb34b4b

Collapses a 32-byte Txid into a uint64_t, using 4 random uint64_ts. Used in place of a hash function as a performance improvement. Co-authored-by: Pieter Wuille <pieter@wuille.net>

coins: filter inputs spending outputs of same block in ProcessInput

1a284c6

This is a performance improvement, because we can skip checking on disk that the input does not exist. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

validation: add -inputfetchthreads configuration option

67fa640

Add a configuration option for the number of worker threads used for parallel UTXO input fetching during block connection. Default is 4 threads, max is 15, 0 disables parallel fetching.

coins: fetch inputs in parallel

9ba8f3a

Leverages the thread pool to fetch inputs on multiple threads, while the overlay serves inputs on the main thread. This is a performance improvement over blocking the main thread to fetch inputs. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

doc: update CoinsViewOverlay docstring to describe parallel fetching

34766ac

Co-authored-by: l0rinc <pap.lorinc@gmail.com>

test: add unit tests for CoinsViewOverlay::StartFetching

444f775

Co-authored-by: l0rinc <pap.lorinc@gmail.com>

fuzz: update harnesses to cover CoinsViewOverlay::StartFetching

77bdb8c

Co-authored-by: l0rinc <pap.lorinc@gmail.com> Co-authored-by: sedited <seb.kung@gmail.com>

fuzz: add coins_view_stacked fuzz harness to test concurrent leveldb …

b633d6e

…reads

temp commit

bb89cd4

willcl-ark force-pushed the master branch 6 times, most recently from f0f2328 to 11770ad Compare May 8, 2026 04:13

willcl-ark force-pushed the master branch 2 times, most recently from a8c8f9f to f1ae07e Compare May 11, 2026 04:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bench 31132 saltedtxid#183

Bench 31132 saltedtxid#183
andrewtoth wants to merge 60 commits into
masterfrom
bench-31132-saltedtxid

andrewtoth commented May 1, 2026

Uh oh!

github-actions Bot commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

andrewtoth commented May 1, 2026

Uh oh!

github-actions Bot commented May 1, 2026

Benchmark Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants