perf(runtime): reduce timing bias by reordering timestamps by rocketman-code · Pull Request #112 · rocketman-code/piano

rocketman-code · 2026-02-25T06:23:45Z

Summary

Add calibration harness with busy-wait reference function that provides ground-truth durations from the same Instant clock Piano uses
Add bias measurement test that quantifies Piano's per-call timing error at multiple durations (100us, 10us, 1us, 100ns)
Reorder enter() so Instant::now() is captured after all bookkeeping (epoch, thread ID, alloc save, stack push)
Reorder Guard::drop() so Instant::now() is captured before all bookkeeping (thread ID check, alloc read, stack pop, alloc restore)

Median bias drops from ~166ns to ~42ns per call -- a 75% reduction. The residual ~42ns is the irreducible cost of two clock reads. Existing ratio accuracy tests still pass.

Test plan

cargo test --workspace passes
cargo clippy --workspace --all-targets -- -D warnings clean
Calibration harness self-validates reference function (inner/outer agree within 200ns)
Bias measurement confirms reduction: ~166ns baseline -> ~42ns after reorder
Existing accuracy suite (ratio tests) still passes

Shrink Guard from 56 bytes to 16 bytes (two registers) by replacing Instant + ThreadId + alloc snapshot with a raw TSC tick and packed thread-cookie/depth. Hot path (enter/drop) is now inlined with a single rdtsc/cntvct_el0 instruction; all bookkeeping is split into cold out-of-line functions. New tsc module handles hardware counter reads, one-time calibration (~2ms spin), and tick-to-nanosecond conversion via simplified ratio. MSRV bumped 1.56 -> 1.59 for core::arch::asm! (inline assembly stabilized in 1.59). Updated Cargo.toml, CI, docs, and MSRV integration test accordingly. Adds _test_internals feature to expose collect_invocations() for external integration tests (calibration harness).

Three #[ignore] benchmarks for development use: - reference_function_accuracy: validates busy_wait ground truth - amortized_overhead: measures enter/drop cost per call over 1M iterations - bias_empty_fn: measures reported time for empty function (pure bias) Run with: cargo test -p piano-runtime --features _test_internals --test calibration -- --ignored --nocapture

rocketman-code force-pushed the perf/calibration-bias branch 2 times, most recently from 8a8baeb to 8b177ad Compare February 25, 2026 10:05

This was referenced Feb 25, 2026

fix: restore function names for migrated async guards #116

Open

docs: document rdtsc cross-core monotonicity limitation #118

Open

rocketman-code added 2 commits February 25, 2026 02:13

rocketman-code force-pushed the perf/calibration-bias branch from 8b177ad to 2db4be4 Compare February 25, 2026 10:14

rocketman-code merged commit 32351c6 into main Feb 25, 2026
5 checks passed

rocketman-code deleted the perf/calibration-bias branch February 25, 2026 10:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(runtime): reduce timing bias by reordering timestamps#112

perf(runtime): reduce timing bias by reordering timestamps#112
rocketman-code merged 2 commits intomainfrom
perf/calibration-bias

rocketman-code commented Feb 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rocketman-code commented Feb 25, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant