fix: Make the server obey the amplification limit by larseggert · Pull Request #3552 · mozilla/neqo

larseggert · 2026-04-10T13:00:19Z

The server was validating the client's address — and dropping the amplification limit — upon receiving a retransmitted client Initial whose DCID matched the server's chosen source CID. RFC 9000 Section 8.1 designates Handshake packets as the validation mechanism; the CID-based path (a MAY) is weaker here because Initial keys are publicly derivable, so the SCID is visible to any on-path observer.

In practice, with MLKEM the ClientHello spans two Initial packets. If only the first reaches the server (as in the QNS amplificationlimit test with drops_to_server=2,3,4,5,6,7), the server sends PTO probes that exhaust most of the amplification budget. When the client retransmits the second fragment using the server's SCID as DCID, the old code validated the path and let the server send the full ~12 KB Handshake flight unrestricted — over 4× amplification.

Fix: remove the DCID condition from postprocess_packet; only Handshake packets validate the path during setup.

Test: anti_amplification_initial_retransmit_no_validate simulates the exact scenario — first fragment only, server ACKs it, client PTO retransmits the second fragment with the server's SCID as DCID — and asserts the server never exceeds 3× amplification across the full paced burst.

codecov · 2026-04-10T13:03:07Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 94.43%. Comparing base (4e35fb3) to head (a498aa1).
⚠️ Report is 14 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3552      +/-   ##
==========================================
- Coverage   94.53%   94.43%   -0.11%     
==========================================
  Files         127      131       +4     
  Lines       39632    39954     +322     
  Branches    39632    39954     +322     
==========================================
+ Hits        37466    37729     +263     
- Misses       1329     1378      +49     
- Partials      837      847      +10

Flag	Coverage Δ
freebsd	`93.49% <100.00%> (-0.10%)`	⬇️
linux	`94.49% <100.00%> (+<0.01%)`	⬆️
macos	`94.43% <100.00%> (-0.01%)`	⬇️
windows	`94.49% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
neqo-common	`98.61% <ø> (ø)`
neqo-crypto	`87.08% <ø> (ø)`
neqo-http3	`93.92% <ø> (ø)`
neqo-qpack	`95.14% <ø> (ø)`
neqo-transport	`95.56% <100.00%> (+<0.01%)`	⬆️
neqo-udp	`84.52% <ø> (-0.38%)`	⬇️
mtu	`86.61% <ø> (ø)`

github-actions

Dismissing empty pending review to resubmit with inline comments.

github-actions

Clean, well-motivated fix. Removing the DCID-based path validation from postprocess_packet is the right call — RFC 9000 §8.1 designates Handshake packets as the address validation mechanism, and Initial keys are publicly derivable, so accepting an Initial DCID match was weaker than intended.

The production change is minimal and correct. The new test directly reproduces the split-Initial scenario and checks the 3× invariant. Two minor suggestions inline.

Copilot

Pull request overview

This PR fixes QUIC server address validation so that it no longer lifts the anti-amplification limit based on a retransmitted client Initial whose DCID matches the server’s Initial SCID; instead, only incoming Handshake packets validate the server’s path during connection setup (per RFC 9000 §8.1).

Changes:

Restrict setup-time path validation on the server to Handshake packets only (removes DCID/SCID matching shortcut).
Add a regression test that reproduces the “MLKEM-split ClientHello + retransmitted Initial uses server SCID as DCID” scenario and asserts the server never exceeds 3× amplification.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`neqo-transport/src/connection/mod.rs`	Removes DCID-based setup path validation so amplification limits aren’t dropped on retransmitted Initials.
`neqo-transport/src/connection/tests/handshake.rs`	Adds a targeted regression test ensuring amplification limits remain enforced across the retransmit scenario.

Copilot · 2026-04-10T13:10:42Z

+        loop {
+            match server.process_output(*now) {
+                Output::Datagram(d) => n += d.len(),
+                Output::Callback(t) if t < AT_LEAST_PTO => *now += t,


The comment says this drains only pacing delays and stops before PTOs fire, but the t < AT_LEAST_PTO condition will also advance time across any timer shorter than 1s (including PTO in this test suite, e.g. 300ms). That can change the scenario by potentially triggering PTO probes during drain. Either update the comment to match the behavior, or tighten the condition to only advance across pacing-scale delays (e.g., using rtt::GRANULARITY).

Suggested change

Output::Callback(t) if t < AT_LEAST_PTO => *now += t,

Output::Callback(t) if t < Duration::from_millis(1) => *now += t,

github-actions

Correct fix. Removing the DCID-based path validation from postprocess_packet is the right approach — RFC 9000 §8.1 designates Handshake packets as the address validation signal during connection setup. Initial packets don't prove the client received the server's TLS messages (Initial keys are publicly derivable from the DCID), so the SCID-matching shortcut was unsound as an amplification-limit bypass.

The production change is minimal (2 lines removed, no new code paths) and no other call sites use local_initial_source_cid for path validation, so the fix is complete.

The regression test directly reproduces the MLKEM split-Initial retransmit scenario and verifies the 3× invariant. The c_init2 assertion from the previous review round has been addressed.

Two prior suggestions remain open and are still valid:

The drain closure's AT_LEAST_PTO threshold (1s) is significantly wider than the "sub-millisecond pacing delays" the comment describes — this won't break the test (the server is amplification-limited regardless), but tightening the guard or updating the comment would improve clarity.
Asserting server.state() != Confirmed at the end would guard against future changes that accidentally validate the path through a different code path.

The server was validating the client's address — and dropping the amplification limit — upon receiving a retransmitted client Initial whose DCID matched the server's chosen source CID. RFC 9000 Section 8.1 designates Handshake packets as the validation mechanism; the CID-based path (a MAY) is weaker here because Initial keys are publicly derivable, so the SCID is visible to any on-path observer. In practice, with MLKEM the ClientHello spans two Initial packets. If only the first reaches the server (as in the QNS amplificationlimit test with drops_to_server=2,3,4,5,6,7), the server sends PTO probes that exhaust most of the amplification budget. When the client retransmits the second fragment using the server's SCID as DCID, the old code validated the path and let the server send the full ~12 KB Handshake flight unrestricted — over 4× amplification.

github-actions

Correct fix. The production change is minimal and well-targeted: removing the DCID-based path validation from postprocess_packet is the right approach per RFC 9000 §8.1. Initial keys are publicly derivable from the DCID, so a matching SCID on a retransmitted Initial does not prove the client received the server's TLS messages — only a Handshake packet does. No other call sites use local_initial_source_cid for path validation, so the fix is complete.

The regression test directly reproduces the MLKEM split-Initial retransmit scenario and checks the 3× invariant. The c_init2 assertion from the earlier review round has been addressed. Two minor suggestions inline (both echoing prior reviewers): tighten the drain loop guard to match its documented "pacing only" intent, and add a State::Confirmed assertion as a second safety net.

github-actions · 2026-04-10T15:28:37Z

+    // Advance through pacing delays (sub-millisecond) but stop before PTOs fire.
+    let drain = |server: &mut Connection, now: &mut _| {
+        let mut n = 0;
+        loop {
+            match server.process_output(*now) {
+                Output::Datagram(d) => n += d.len(),
+                Output::Callback(t) if t < AT_LEAST_PTO => *now += t,
+                _ => break,
+            }
+        }
+        n
+    };


Warning

AT_LEAST_PTO is 1 s, which exceeds the initial PTO (~300 ms with DEFAULT_RTT = 100 ms). The drain loop will happily advance through a PTO timer and trigger probe retransmissions, inflating sent beyond what pacing alone would produce. This doesn't break the test (the server is amplification-limited regardless), but the comment is misleading and the loop does more than advertised.

Either tighten the guard to truly only cover pacing delays, or update the comment. Using a sub-millisecond ceiling is cleanest since pacing intervals at this scale are microsecond-order:

Suggested change

// Advance through pacing delays (sub-millisecond) but stop before PTOs fire.

let drain = |server: &mut Connection, now: &mut _| {

let mut n = 0;

loop {

match server.process_output(*now) {

Output::Datagram(d) => n += d.len(),

Output::Callback(t) if t < AT_LEAST_PTO => *now += t,

_ => break,

}

}

n

};

// Advance through pacing delays (sub-millisecond) but stop before any

// larger timer (PTO, idle, etc.) fires.

let drain = |server: &mut Connection, now: &mut _| {

let mut n = 0;

loop {

match server.process_output(*now) {

Output::Datagram(d) => n += d.len(),

Output::Callback(t) if t < Duration::from_millis(1) => *now += t,

_ => break,

}

}

n

};

github-actions · 2026-04-10T15:28:42Z

+    assert!(
+        sent <= received * 3,
+        "server sent {sent} bytes but received only {received} bytes (limit is 3x)",
+    );
+}


Note

+1 to the earlier suggestion to assert the server hasn't reached Confirmed. Without it, a future change that accidentally validates the path through another code path would let the server send unrestricted — the 3× assertion could still pass trivially, and the test would silently stop covering the amplification limit.

Suggested change

assert!(

sent <= received * 3,

"server sent {sent} bytes but received only {received} bytes (limit is 3x)",

);

}

assert!(

sent <= received * 3,

"server sent {sent} bytes but received only {received} bytes (limit is 3x)",

);

assert_ne!(

*server.state(),

State::Confirmed,

"path should not be validated yet — only a Handshake packet should do that"

);

}

github-actions · 2026-04-10T16:00:30Z

Failed Interop Tests

QUIC Interop Runner, client vs. server, differences relative to main at a4cb75b.

neqo-pr as client	neqo-pr as server
neqo-pr vs. go-x-net: BP BA neqo-pr vs. haproxy: BP BA neqo-pr vs. kwik: S L1 C1 BP BA neqo-pr vs. lsquic: run cancelled after 20 min neqo-pr vs. msquic: A L1 C1 neqo-pr vs. mvfst: H DC LR M R Z 3 B U A L1 L2 C1 C2 6 BP BA neqo-pr vs. neqo: Z A neqo-pr vs. nginx: BP BA neqo-pr vs. ngtcp2: CM neqo-pr vs. picoquic: A neqo-pr vs. quic-go: A neqo-pr vs. quiche: BP BA neqo-pr vs. s2n-quic: BP 🚀BA CM neqo-pr vs. tquic: S BP BA neqo-pr vs. xquic: A 🚀L1 C1	aioquic vs. neqo-pr: Z CM go-x-net vs. neqo-pr: CM kwik vs. neqo-pr: Z BP BA CM lsquic vs. neqo-pr: Z msquic vs. neqo-pr: Z CM mvfst vs. neqo-pr: Z A L1 C1 CM neqo vs. neqo-pr: Z 🚀A openssl vs. neqo-pr: LR M 🚀A CM picoquic vs. neqo-pr: Z quic-go vs. neqo-pr: ⚠️Z CM quiche vs. neqo-pr: Z CM quinn vs. neqo-pr: Z V2 CM s2n-quic vs. neqo-pr: ⚠️B CM tquic vs. neqo-pr: Z CM xquic vs. neqo-pr: M CM

All results

Succeeded Interop Tests

QUIC Interop Runner, client vs. server

neqo-pr as client

neqo-pr vs. aioquic: H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6 V2 BP BA
neqo-pr vs. go-x-net: H DC LR M B U A L2 C2 6
neqo-pr vs. haproxy: H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6 V2
neqo-pr vs. kwik: H DC LR C20 M R Z 3 B U A L2 C2 6 V2
neqo-pr vs. linuxquic: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM
neqo-pr vs. msquic: H DC LR C20 M S R Z B U L2 C2 6 V2 BP BA
neqo-pr vs. neqo: H DC LR C20 M S R 3 B U E L1 L2 C1 C2 6 V2 BP BA CM
neqo-pr vs. nginx: H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6
neqo-pr vs. ngtcp2: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA
neqo-pr vs. picoquic: H DC LR C20 M S R Z 3 B U E L1 L2 C1 C2 6 V2 BP BA CM
neqo-pr vs. quic-go: H DC LR C20 M S R Z 3 B U L1 L2 C1 C2 6 BP BA
neqo-pr vs. quiche: H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6
neqo-pr vs. quinn: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 BP BA
neqo-pr vs. s2n-quic: H DC LR C20 M S R 3 B U E A L1 L2 C1 C2 6 🚀BA
neqo-pr vs. tquic: H DC LR C20 M R Z 3 B U A L1 L2 C1 C2 6
neqo-pr vs. xquic: H DC LR C20 M S R Z 3 B U 🚀L1 L2 C2 6 BP BA

neqo-pr as server

aioquic vs. neqo-pr: H DC LR C20 M S R 3 B U A L1 L2 C1 C2 6 V2 BP BA
chrome vs. neqo-pr: 3
go-x-net vs. neqo-pr: H DC LR M B U A L2 C2 6 BP BA
kwik vs. neqo-pr: H DC LR C20 M S R 3 B U A L1 L2 C1 C2 6 V2
linuxquic vs. neqo-pr: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM
lsquic vs. neqo-pr: H DC LR C20 M S R 3 B E A L1 L2 C1 C2 6 V2 BP BA CM
msquic vs. neqo-pr: H DC LR C20 M S R B U A L1 L2 C1 C2 6 V2 BP BA
mvfst vs. neqo-pr: H DC LR M 3 B L2 C2 6 BP BA
neqo vs. neqo-pr: H DC LR C20 M S R 3 B U E 🚀A L1 L2 C1 C2 6 V2 BP BA CM
ngtcp2 vs. neqo-pr: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM
openssl vs. neqo-pr: H DC C20 S R 3 B 🚀A L2 C2 6 BP BA
picoquic vs. neqo-pr: H DC LR C20 M S R 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM
quic-go vs. neqo-pr: H DC LR C20 M S R ⚠️Z 3 B U A L1 L2 C1 C2 6 BP BA
quiche vs. neqo-pr: H DC LR M S R 3 B A L1 L2 C1 C2 6 BP BA
quinn vs. neqo-pr: H DC LR C20 M S R 3 B U E A L1 L2 C1 C2 6 BP BA
s2n-quic vs. neqo-pr: H DC LR M S R 3 ⚠️B E A L1 L2 C1 C2 6 BP BA
tquic vs. neqo-pr: H DC LR M S R 3 B A L1 L2 C1 C2 6 BP BA
xquic vs. neqo-pr: H DC LR C20 S R Z 3 B U A L1 L2 C1 C2 6 BP BA

Unsupported Interop Tests

QUIC Interop Runner, client vs. server

neqo-pr as client

neqo-pr vs. aioquic: E CM
neqo-pr vs. go-x-net: C20 S R Z 3 E L1 C1 V2 CM
neqo-pr vs. haproxy: E CM
neqo-pr vs. kwik: E CM
neqo-pr vs. msquic: 3 E CM
neqo-pr vs. mvfst: C20 S E V2 CM
neqo-pr vs. nginx: E V2 CM
neqo-pr vs. quic-go: E V2 CM
neqo-pr vs. quiche: E V2 CM
neqo-pr vs. quinn: V2 CM
neqo-pr vs. s2n-quic: Z V2
neqo-pr vs. tquic: E V2 CM
neqo-pr vs. xquic: E V2 CM

neqo-pr as server

aioquic vs. neqo-pr: E
chrome vs. neqo-pr: H DC LR C20 M S R Z B U E A L1 L2 C1 C2 6 V2 BP BA CM
go-x-net vs. neqo-pr: C20 S R Z 3 E L1 C1 V2
kwik vs. neqo-pr: E
lsquic vs. neqo-pr: U
msquic vs. neqo-pr: 3 E
mvfst vs. neqo-pr: C20 S R U E V2
openssl vs. neqo-pr: Z U E L1 C1 V2
quic-go vs. neqo-pr: E V2
quiche vs. neqo-pr: C20 U E V2
s2n-quic vs. neqo-pr: C20 Z U V2
tquic vs. neqo-pr: C20 U E V2
xquic vs. neqo-pr: E V2

github-actions · 2026-04-10T16:04:01Z

Client/server transfer results

Performance differences relative to a4cb75b.

Transfer of 33554432 bytes over loopback, min. 100 runs. All unit-less numbers are in milliseconds.

Client vs. server (params)	Mean ± σ	Min	Max	MiB/s ± σ	Δ `baseline`	Δ `baseline`
neqo-neqo-cubic	98.1 ± 4.2	88.6	107.5	326.3 ± 7.6	💔 1.2	1.2%
neqo-neqo-newreno	94.2 ± 3.6	87.4	106.4	339.5 ± 8.9	💚 -2.3	-2.4%
neqo-neqo-newreno-nopacing	95.9 ± 4.3	85.5	105.0	333.9 ± 7.4	💚 -2.4	-2.4%
neqo-quiche-cubic	189.0 ± 2.9	185.4	199.2	169.3 ± 11.0	💚 -1.2	-0.7%

Table above only shows statistically significant changes. See all results below.

All results

Transfer of 33554432 bytes over loopback, min. 100 runs. All unit-less numbers are in milliseconds.

Client vs. server (params)	Mean ± σ	Min	Max	MiB/s ± σ	Δ `baseline`	Δ `baseline`
google-google-nopacing	450.4 ± 4.3	443.7	464.1	71.1 ± 7.4
google-neqo-cubic	270.3 ± 4.5	260.6	278.0	118.4 ± 7.1	0.0	0.0%
msquic-msquic-nopacing	184.5 ± 86.1	120.4	489.9	173.4 ± 0.4
msquic-neqo-cubic	205.9 ± 89.8	134.5	609.9	155.4 ± 0.4	-0.4	-0.2%
neqo-google-cubic	752.4 ± 6.0	744.2	786.5	42.5 ± 5.3	1.4	0.2%
neqo-msquic-cubic	152.9 ± 4.5	147.6	161.3	209.3 ± 7.1	-1.2	-0.8%
neqo-neqo-cubic	98.1 ± 4.2	88.6	107.5	326.3 ± 7.6	💔 1.2	1.2%
neqo-neqo-cubic-nopacing	96.5 ± 4.6	86.8	110.3	331.6 ± 7.0	0.5	0.6%
neqo-neqo-newreno	94.2 ± 3.6	87.4	106.4	339.5 ± 8.9	💚 -2.3	-2.4%
neqo-neqo-newreno-nopacing	95.9 ± 4.3	85.5	105.0	333.9 ± 7.4	💚 -2.4	-2.4%
neqo-quiche-cubic	189.0 ± 2.9	185.4	199.2	169.3 ± 11.0	💚 -1.2	-0.7%
neqo-s2n-cubic	219.5 ± 4.5	212.7	234.0	145.8 ± 7.1	-0.2	-0.1%
quiche-neqo-cubic	180.9 ± 5.1	169.9	193.6	176.9 ± 6.3	0.1	0.1%
quiche-quiche-nopacing	142.9 ± 3.9	135.7	153.5	224.0 ± 8.2
s2n-neqo-cubic	219.9 ± 4.2	211.8	230.4	145.5 ± 7.6	-0.1	-0.1%
s2n-s2n-nopacing	295.6 ± 26.2	281.2	397.9	108.2 ± 1.2

Download data for profiler.firefox.com or download performance comparison data.

github-actions · 2026-04-10T16:06:12Z

Benchmark results

Significant performance differences relative to 7c0b85f.

transfer/1-conn/1-100mb-req (aka. Upload)/mtu-1504: 💔 Performance has regressed by +1.6375%.

       time:   [207.96 ms 208.33 ms 208.71 ms]
       thrpt:  [479.13 MiB/s 480.01 MiB/s 480.86 MiB/s]
change:
       time:   [+1.3886% +1.6375% +1.8998] (p = 0.00 < 0.05)
       thrpt:  [-1.8644% -1.6112% -1.3695]
       Performance has regressed.
Found 5 outliers among 100 measurements (5.00%)
1 (1.00%) low mild
4 (4.00%) high mild

All results

transfer/1-conn/1-100mb-resp (aka. Download)/mtu-1504: Change within noise threshold.

       time:   [204.06 ms 204.43 ms 204.79 ms]
       thrpt:  [488.31 MiB/s 489.18 MiB/s 490.06 MiB/s]
change:
       time:   [+0.4868% +0.7369% +0.9890] (p = 0.00 < 0.05)
       thrpt:  [-0.9793% -0.7315% -0.4844]
       Change within noise threshold.
Found 4 outliers among 100 measurements (4.00%)
3 (3.00%) low mild
1 (1.00%) high mild

transfer/1-conn/10_000-parallel-1b-resp (aka. RPS)/mtu-1504: No change in performance detected.

       time:   [285.46 ms 287.18 ms 288.88 ms]
       thrpt:  [34.616 Kelem/s 34.821 Kelem/s 35.031 Kelem/s]
change:
       time:   [-1.2251% -0.3415% +0.5381] (p = 0.46 > 0.05)
       thrpt:  [-0.5352% +0.3427% +1.2403]
       No change in performance detected.

transfer/1-conn/1-1b-resp (aka. HPS)/mtu-1504: No change in performance detected.

       time:   [38.490 ms 38.640 ms 38.809 ms]
       thrpt:  [25.767   B/s 25.880   B/s 25.981   B/s]
change:
       time:   [-0.7318% -0.1507% +0.4306] (p = 0.62 > 0.05)
       thrpt:  [-0.4288% +0.1509% +0.7372]
       No change in performance detected.
Found 7 outliers among 100 measurements (7.00%)
1 (1.00%) low mild
2 (2.00%) high mild
4 (4.00%) high severe

transfer/1-conn/1-100mb-req (aka. Upload)/mtu-1504: 💔 Performance has regressed by +1.6375%.

       time:   [207.96 ms 208.33 ms 208.71 ms]
       thrpt:  [479.13 MiB/s 480.01 MiB/s 480.86 MiB/s]
change:
       time:   [+1.3886% +1.6375% +1.8998] (p = 0.00 < 0.05)
       thrpt:  [-1.8644% -1.6112% -1.3695]
       Performance has regressed.
Found 5 outliers among 100 measurements (5.00%)
1 (1.00%) low mild
4 (4.00%) high mild

streams/walltime/1-streams/each-1000-bytes: Change within noise threshold.

       time:   [587.42 µs 588.82 µs 590.60 µs]
       change: [-0.9947% -0.5317% -0.1066] (p = 0.02 < 0.05)
       Change within noise threshold.
Found 6 outliers among 100 measurements (6.00%)
2 (2.00%) high mild
4 (4.00%) high severe

streams/walltime/1000-streams/each-1-bytes: No change in performance detected.

       time:   [12.335 ms 12.370 ms 12.421 ms]
       change: [-0.1944% +0.1459% +0.6107] (p = 0.52 > 0.05)
       No change in performance detected.
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high severe

streams/walltime/1000-streams/each-1000-bytes: Change within noise threshold.

       time:   [45.187 ms 45.231 ms 45.275 ms]
       change: [+0.5951% +0.9286% +1.1555] (p = 0.00 < 0.05)
       Change within noise threshold.
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high mild

transfer/walltime/pacing-false/varying-seeds: Change within noise threshold.

       time:   [81.384 ms 81.477 ms 81.566 ms]
       change: [-0.3721% -0.2324% -0.1001] (p = 0.00 < 0.05)
       Change within noise threshold.
Found 8 outliers among 100 measurements (8.00%)
2 (2.00%) low severe
3 (3.00%) low mild
1 (1.00%) high mild
2 (2.00%) high severe

transfer/walltime/pacing-true/varying-seeds: Change within noise threshold.

       time:   [81.153 ms 81.215 ms 81.282 ms]
       change: [-0.3313% -0.2189% -0.1054] (p = 0.00 < 0.05)
       Change within noise threshold.
Found 4 outliers among 100 measurements (4.00%)
3 (3.00%) high mild
1 (1.00%) high severe

transfer/walltime/pacing-false/same-seed: Change within noise threshold.

       time:   [79.569 ms 79.711 ms 79.865 ms]
       change: [-0.7318% -0.5494% -0.3571] (p = 0.00 < 0.05)
       Change within noise threshold.
Found 13 outliers among 100 measurements (13.00%)
13 (13.00%) high mild

transfer/walltime/pacing-true/same-seed: Change within noise threshold.

       time:   [81.685 ms 81.798 ms 81.920 ms]
       change: [+0.1853% +0.3393% +0.5004] (p = 0.00 < 0.05)
       Change within noise threshold.
Found 10 outliers among 100 measurements (10.00%)
5 (5.00%) high mild
5 (5.00%) high severe

Download data for profiler.firefox.com or download performance comparison data.

martinthomson

Removing a condition that provides path validation in order to make tests pass is backwards. The fix is to ensure that the test correctly measures the thing we care about.

larseggert · 2026-04-13T05:27:27Z

@marten-seemann has been sitting on quic-interop/quic-interop-runner#442 for a while and I got impatient with the spurious failures.

github-actions

The security analysis in the PR description is correct: Initial keys are publicly derivable, so a DCID match on a retransmitted Initial is a weaker address-validation signal than a Handshake packet. However, as @martinthomson noted, removing the DCID-based path validation to satisfy a test inverts the causality — the question is what behavior is correct, not what makes a test pass.

The core issue is that set_valid() is binary: the DCID match was granting usize::MAX amplification budget — identical to Handshake validation. A middle path exists: keep the DCID recognition but apply a bounded budget increase instead of full validation. This would preserve the latency benefit in MLKEM split-Initial scenarios while still bounding amplification.

Two open suggestions from prior reviewers remain valid on the test:

The drain closure uses AT_LEAST_PTO (1s) as its threshold, which exceeds the actual PTO (~300ms) and will advance through PTO timers — misleading relative to its "sub-millisecond pacing" comment. Should be tightened to ~1ms or the comment updated.
An assert_ne!(*server.state(), State::Confirmed) at the end would guard against the 3× check passing trivially if a future change accidentally validates the path.

github-actions · 2026-04-15T13:57:55Z

-            && (packet.packet_type() == packet::Type::Handshake
-                || (packet.dcid().len() >= 8 && packet.dcid() == self.local_initial_source_cid))
-        {
+        } else if self.role != Role::Client && packet.packet_type() == packet::Type::Handshake {


Caution

Agreeing with @martinthomson's review — the directionality here needs reconsideration.

The removed DCID condition was the only pre-Handshake path validation the server had. set_valid() sets ProbeState::Valid, which makes amplification_limit() return usize::MAX (path.rs:1089). So this was an all-or-nothing switch: the DCID match — a weaker signal than a Handshake — was granting the same unlimited sending privilege as Handshake receipt.

The security argument in the PR description is sound (Initial keys are publicly derivable, so an on-path observer who sees the server's Initial learns the SCID and could forge a matching DCID). But if the DCID-based check is considered valuable for performance (it avoids an extra RTT in the MLKEM split-Initial case where the server has a large Handshake flight), the fix might be to keep the check but not call set_valid() — instead, extend the amplification budget (e.g., increase the multiplier) without fully validating the path. That would preserve the latency benefit while still bounding amplification.

As it stands, removing the condition entirely means the server will be 3×-limited for an additional RTT whenever the ClientHello is split across two Initials and the second is lost, which is the common case with MLKEM + lossy networks.

larseggert requested review from KershawChang, martinthomson and mxinden as code owners April 10, 2026 13:00

Copilot AI review requested due to automatic review settings April 10, 2026 13:00

Copilot started reviewing on behalf of larseggert April 10, 2026 13:01 View session