-
Notifications
You must be signed in to change notification settings - Fork 35
verify memory usage: Unpack hints h on the fly #897
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
This commit splits up the signature unpacking into two parts: unpack_sig_c_z and unpack_sig_h. This allows delaying unpacking of the hints h until later during the verification which in turn allows re-using the buffer used for the matrix. This cuts memory consumption by L KiB. - Hoisted out from #751 Signed-off-by: Matthias J. Kannwischer <matthias@kannwischer.eu>
fa2a1b1 to
f6503be
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mac Mini (M1, 2020) benchmarks (opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
46204 cycles |
46204 cycles |
1 |
ML-DSA-44 sign |
131270 cycles |
131278 cycles |
1.00 |
ML-DSA-44 verify |
47776 cycles |
47765 cycles |
1.00 |
ML-DSA-65 keypair |
81015 cycles |
81031 cycles |
1.00 |
ML-DSA-65 sign |
215698 cycles |
215800 cycles |
1.00 |
ML-DSA-65 verify |
80361 cycles |
80058 cycles |
1.00 |
ML-DSA-87 keypair |
132160 cycles |
132172 cycles |
1.00 |
ML-DSA-87 sign |
276871 cycles |
276892 cycles |
1.00 |
ML-DSA-87 verify |
130562 cycles |
130410 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mac Mini (M1, 2020) benchmarks (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
114193 cycles |
114156 cycles |
1.00 |
ML-DSA-44 sign |
418150 cycles |
418052 cycles |
1.00 |
ML-DSA-44 verify |
122661 cycles |
122255 cycles |
1.00 |
ML-DSA-65 keypair |
195474 cycles |
195490 cycles |
1.00 |
ML-DSA-65 sign |
682605 cycles |
682460 cycles |
1.00 |
ML-DSA-65 verify |
198053 cycles |
197755 cycles |
1.00 |
ML-DSA-87 keypair |
322541 cycles |
322642 cycles |
1.00 |
ML-DSA-87 sign |
864349 cycles |
864696 cycles |
1.00 |
ML-DSA-87 verify |
328718 cycles |
328644 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks (opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
113417 cycles |
113348 cycles |
1.00 |
ML-DSA-44 sign |
356123 cycles |
355932 cycles |
1.00 |
ML-DSA-44 verify |
117980 cycles |
118018 cycles |
1.00 |
ML-DSA-65 keypair |
196577 cycles |
196546 cycles |
1.00 |
ML-DSA-65 sign |
589169 cycles |
589059 cycles |
1.00 |
ML-DSA-65 verify |
194566 cycles |
194746 cycles |
1.00 |
ML-DSA-87 keypair |
322422 cycles |
322709 cycles |
1.00 |
ML-DSA-87 sign |
752735 cycles |
753072 cycles |
1.00 |
ML-DSA-87 verify |
319942 cycles |
320128 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Intel Xeon 4th gen (c7i)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
35141 cycles |
34372 cycles |
1.02 |
ML-DSA-44 sign |
120136 cycles |
120027 cycles |
1.00 |
ML-DSA-44 verify |
38187 cycles |
38103 cycles |
1.00 |
ML-DSA-65 keypair |
60745 cycles |
60659 cycles |
1.00 |
ML-DSA-65 sign |
200543 cycles |
200397 cycles |
1.00 |
ML-DSA-65 verify |
62781 cycles |
62812 cycles |
1.00 |
ML-DSA-87 keypair |
94010 cycles |
93992 cycles |
1.00 |
ML-DSA-87 sign |
239553 cycles |
238255 cycles |
1.01 |
ML-DSA-87 verify |
96178 cycles |
94920 cycles |
1.01 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Intel Xeon 4th gen (c7i) (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
93587 cycles |
94201 cycles |
0.99 |
ML-DSA-44 sign |
333236 cycles |
334218 cycles |
1.00 |
ML-DSA-44 verify |
99980 cycles |
99867 cycles |
1.00 |
ML-DSA-65 keypair |
159709 cycles |
160034 cycles |
1.00 |
ML-DSA-65 sign |
545589 cycles |
545365 cycles |
1.00 |
ML-DSA-65 verify |
161523 cycles |
160650 cycles |
1.01 |
ML-DSA-87 keypair |
267333 cycles |
268199 cycles |
1.00 |
ML-DSA-87 sign |
706937 cycles |
707903 cycles |
1.00 |
ML-DSA-87 verify |
270917 cycles |
270461 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
212633 cycles |
212419 cycles |
1.00 |
ML-DSA-44 sign |
759965 cycles |
759723 cycles |
1.00 |
ML-DSA-44 verify |
228830 cycles |
228844 cycles |
1.00 |
ML-DSA-65 keypair |
380315 cycles |
380545 cycles |
1.00 |
ML-DSA-65 sign |
1253538 cycles |
1252640 cycles |
1.00 |
ML-DSA-65 verify |
371994 cycles |
371680 cycles |
1.00 |
ML-DSA-87 keypair |
604406 cycles |
604445 cycles |
1.00 |
ML-DSA-87 sign |
1597737 cycles |
1594215 cycles |
1.00 |
ML-DSA-87 verify |
617054 cycles |
618697 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
AMD EPYC 3rd gen (c6a)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
68963 cycles |
68940 cycles |
1.00 |
ML-DSA-44 sign |
187390 cycles |
187846 cycles |
1.00 |
ML-DSA-44 verify |
69100 cycles |
69282 cycles |
1.00 |
ML-DSA-65 keypair |
119833 cycles |
119446 cycles |
1.00 |
ML-DSA-65 sign |
300090 cycles |
301247 cycles |
1.00 |
ML-DSA-65 verify |
115173 cycles |
115447 cycles |
1.00 |
ML-DSA-87 keypair |
203799 cycles |
202992 cycles |
1.00 |
ML-DSA-87 sign |
394125 cycles |
393314 cycles |
1.00 |
ML-DSA-87 verify |
194984 cycles |
195680 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Intel Xeon 3rd gen (c6i)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
56510 cycles |
57044 cycles |
0.99 |
ML-DSA-44 sign |
181510 cycles |
181848 cycles |
1.00 |
ML-DSA-44 verify |
61226 cycles |
61082 cycles |
1.00 |
ML-DSA-65 keypair |
98821 cycles |
98510 cycles |
1.00 |
ML-DSA-65 sign |
298523 cycles |
299264 cycles |
1.00 |
ML-DSA-65 verify |
100377 cycles |
100372 cycles |
1.00 |
ML-DSA-87 keypair |
152801 cycles |
152400 cycles |
1.00 |
ML-DSA-87 sign |
354475 cycles |
355097 cycles |
1.00 |
ML-DSA-87 verify |
153732 cycles |
153647 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
AMD EPYC 4th gen (c7a)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
41579 cycles |
41768 cycles |
1.00 |
ML-DSA-44 sign |
134840 cycles |
134643 cycles |
1.00 |
ML-DSA-44 verify |
44687 cycles |
44801 cycles |
1.00 |
ML-DSA-65 keypair |
72607 cycles |
73157 cycles |
0.99 |
ML-DSA-65 sign |
215866 cycles |
217074 cycles |
0.99 |
ML-DSA-65 verify |
75115 cycles |
73831 cycles |
1.02 |
ML-DSA-87 keypair |
107561 cycles |
108344 cycles |
0.99 |
ML-DSA-87 sign |
251470 cycles |
252157 cycles |
1.00 |
ML-DSA-87 verify |
109366 cycles |
110026 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
AMD EPYC 3rd gen (c6a) (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
134907 cycles |
134815 cycles |
1.00 |
ML-DSA-44 sign |
523677 cycles |
524851 cycles |
1.00 |
ML-DSA-44 verify |
147315 cycles |
147535 cycles |
1.00 |
ML-DSA-65 keypair |
226868 cycles |
226816 cycles |
1.00 |
ML-DSA-65 sign |
860135 cycles |
861519 cycles |
1.00 |
ML-DSA-65 verify |
234724 cycles |
234798 cycles |
1.00 |
ML-DSA-87 keypair |
371386 cycles |
372163 cycles |
1.00 |
ML-DSA-87 sign |
1083150 cycles |
1081624 cycles |
1.00 |
ML-DSA-87 verify |
383992 cycles |
383383 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Intel Xeon 3rd gen (c6i) (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
157793 cycles |
158774 cycles |
0.99 |
ML-DSA-44 sign |
549648 cycles |
553689 cycles |
0.99 |
ML-DSA-44 verify |
169942 cycles |
170266 cycles |
1.00 |
ML-DSA-65 keypair |
268891 cycles |
268386 cycles |
1.00 |
ML-DSA-65 sign |
907860 cycles |
904101 cycles |
1.00 |
ML-DSA-65 verify |
275702 cycles |
274733 cycles |
1.00 |
ML-DSA-87 keypair |
448617 cycles |
448848 cycles |
1.00 |
ML-DSA-87 sign |
1158163 cycles |
1158629 cycles |
1.00 |
ML-DSA-87 verify |
460215 cycles |
459042 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Graviton4
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
68464 cycles |
68285 cycles |
1.00 |
ML-DSA-44 sign |
201880 cycles |
201946 cycles |
1.00 |
ML-DSA-44 verify |
70902 cycles |
70642 cycles |
1.00 |
ML-DSA-65 keypair |
121022 cycles |
121073 cycles |
1.00 |
ML-DSA-65 sign |
330661 cycles |
331230 cycles |
1.00 |
ML-DSA-65 verify |
117884 cycles |
117970 cycles |
1.00 |
ML-DSA-87 keypair |
198310 cycles |
198361 cycles |
1.00 |
ML-DSA-87 sign |
426820 cycles |
426811 cycles |
1.00 |
ML-DSA-87 verify |
194029 cycles |
194224 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
AMD EPYC 4th gen (c7a) (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
120072 cycles |
121340 cycles |
0.99 |
ML-DSA-44 sign |
447197 cycles |
450417 cycles |
0.99 |
ML-DSA-44 verify |
130084 cycles |
131705 cycles |
0.99 |
ML-DSA-65 keypair |
204835 cycles |
206768 cycles |
0.99 |
ML-DSA-65 sign |
729102 cycles |
730995 cycles |
1.00 |
ML-DSA-65 verify |
210498 cycles |
211843 cycles |
0.99 |
ML-DSA-87 keypair |
337641 cycles |
340681 cycles |
0.99 |
ML-DSA-87 sign |
923064 cycles |
929434 cycles |
0.99 |
ML-DSA-87 verify |
346540 cycles |
347157 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Graviton3
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
72250 cycles |
72206 cycles |
1.00 |
ML-DSA-44 sign |
212030 cycles |
212085 cycles |
1.00 |
ML-DSA-44 verify |
75754 cycles |
75732 cycles |
1.00 |
ML-DSA-65 keypair |
127381 cycles |
127427 cycles |
1.00 |
ML-DSA-65 sign |
350265 cycles |
350098 cycles |
1.00 |
ML-DSA-65 verify |
125532 cycles |
125335 cycles |
1.00 |
ML-DSA-87 keypair |
205895 cycles |
208160 cycles |
0.99 |
ML-DSA-87 sign |
444237 cycles |
448888 cycles |
0.99 |
ML-DSA-87 verify |
205727 cycles |
205075 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Graviton4 (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
128554 cycles |
128290 cycles |
1.00 |
ML-DSA-44 sign |
447128 cycles |
447683 cycles |
1.00 |
ML-DSA-44 verify |
138110 cycles |
138350 cycles |
1.00 |
ML-DSA-65 keypair |
220383 cycles |
220301 cycles |
1.00 |
ML-DSA-65 sign |
726363 cycles |
727535 cycles |
1.00 |
ML-DSA-65 verify |
222829 cycles |
223194 cycles |
1.00 |
ML-DSA-87 keypair |
365105 cycles |
365100 cycles |
1.00 |
ML-DSA-87 sign |
926679 cycles |
926074 cycles |
1.00 |
ML-DSA-87 verify |
372574 cycles |
372724 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Graviton2
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
114074 cycles |
113737 cycles |
1.00 |
ML-DSA-44 sign |
356790 cycles |
356305 cycles |
1.00 |
ML-DSA-44 verify |
118382 cycles |
118106 cycles |
1.00 |
ML-DSA-65 keypair |
196886 cycles |
196661 cycles |
1.00 |
ML-DSA-65 sign |
590040 cycles |
589237 cycles |
1.00 |
ML-DSA-65 verify |
194865 cycles |
194763 cycles |
1.00 |
ML-DSA-87 keypair |
323008 cycles |
323538 cycles |
1.00 |
ML-DSA-87 sign |
753229 cycles |
754524 cycles |
1.00 |
ML-DSA-87 verify |
320370 cycles |
320437 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Graviton3 (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
138678 cycles |
138511 cycles |
1.00 |
ML-DSA-44 sign |
483594 cycles |
484146 cycles |
1.00 |
ML-DSA-44 verify |
148450 cycles |
148699 cycles |
1.00 |
ML-DSA-65 keypair |
242267 cycles |
242282 cycles |
1.00 |
ML-DSA-65 sign |
792332 cycles |
792625 cycles |
1.00 |
ML-DSA-65 verify |
240741 cycles |
241147 cycles |
1.00 |
ML-DSA-87 keypair |
395864 cycles |
396266 cycles |
1.00 |
ML-DSA-87 sign |
1012467 cycles |
1012387 cycles |
1.00 |
ML-DSA-87 verify |
402710 cycles |
402523 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Graviton2 (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
212963 cycles |
212757 cycles |
1.00 |
ML-DSA-44 sign |
760596 cycles |
760583 cycles |
1.00 |
ML-DSA-44 verify |
229256 cycles |
229095 cycles |
1.00 |
ML-DSA-65 keypair |
380644 cycles |
380969 cycles |
1.00 |
ML-DSA-65 sign |
1252949 cycles |
1254036 cycles |
1.00 |
ML-DSA-65 verify |
372381 cycles |
371934 cycles |
1.00 |
ML-DSA-87 keypair |
605930 cycles |
604475 cycles |
1.00 |
ML-DSA-87 sign |
1596320 cycles |
1595118 cycles |
1.00 |
ML-DSA-87 verify |
618008 cycles |
618516 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
SpacemiT K1 8 (Banana Pi F3) benchmarks (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
829217 cycles |
827059 cycles |
1.00 |
ML-DSA-44 sign |
3235321 cycles |
3231963 cycles |
1.00 |
ML-DSA-44 verify |
922407 cycles |
922300 cycles |
1.00 |
ML-DSA-65 keypair |
1408325 cycles |
1412898 cycles |
1.00 |
ML-DSA-65 sign |
5337316 cycles |
5354291 cycles |
1.00 |
ML-DSA-65 verify |
1478483 cycles |
1480666 cycles |
1.00 |
ML-DSA-87 keypair |
2309529 cycles |
2312958 cycles |
1.00 |
ML-DSA-87 sign |
6650066 cycles |
6668645 cycles |
1.00 |
ML-DSA-87 verify |
2408322 cycles |
2409942 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks (opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
229171 cycles |
228876 cycles |
1.00 |
ML-DSA-44 sign |
629704 cycles |
614814 cycles |
1.02 |
ML-DSA-44 verify |
234317 cycles |
217070 cycles |
1.08 |
ML-DSA-65 keypair |
386832 cycles |
394718 cycles |
0.98 |
ML-DSA-65 sign |
1018531 cycles |
991561 cycles |
1.03 |
ML-DSA-65 verify |
370168 cycles |
376152 cycles |
0.98 |
ML-DSA-87 keypair |
658975 cycles |
646097 cycles |
1.02 |
ML-DSA-87 sign |
1406876 cycles |
1332158 cycles |
1.06 |
ML-DSA-87 verify |
647532 cycles |
625886 cycles |
1.03 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
⚠️ Performance Alert ⚠️
Possible performance regression was detected for benchmark 'Arm Cortex-A72 (Raspberry Pi 4) benchmarks (opt)'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.03.
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 verify |
234317 cycles |
217070 cycles |
1.08 |
ML-DSA-87 sign |
1406876 cycles |
1332158 cycles |
1.06 |
ML-DSA-87 verify |
647532 cycles |
625886 cycles |
1.03 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Arm Cortex-A55 (Snapdragon 888) benchmarks (opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
272086 cycles |
273072 cycles |
1.00 |
ML-DSA-44 sign |
799452 cycles |
805254 cycles |
0.99 |
ML-DSA-44 verify |
272361 cycles |
273857 cycles |
0.99 |
ML-DSA-65 keypair |
468476 cycles |
468169 cycles |
1.00 |
ML-DSA-65 sign |
1317665 cycles |
1319237 cycles |
1.00 |
ML-DSA-65 verify |
451223 cycles |
449707 cycles |
1.00 |
ML-DSA-87 keypair |
805715 cycles |
804538 cycles |
1.00 |
ML-DSA-87 sign |
1808353 cycles |
1809292 cycles |
1.00 |
ML-DSA-87 verify |
778420 cycles |
779109 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
309867 cycles |
312735 cycles |
0.99 |
ML-DSA-44 sign |
1154040 cycles |
1199505 cycles |
0.96 |
ML-DSA-44 verify |
333978 cycles |
345578 cycles |
0.97 |
ML-DSA-65 keypair |
565534 cycles |
582435 cycles |
0.97 |
ML-DSA-65 sign |
1932069 cycles |
1976366 cycles |
0.98 |
ML-DSA-65 verify |
526835 cycles |
554587 cycles |
0.95 |
ML-DSA-87 keypair |
856162 cycles |
873951 cycles |
0.98 |
ML-DSA-87 sign |
2425164 cycles |
2499834 cycles |
0.97 |
ML-DSA-87 verify |
882491 cycles |
890351 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Arm Cortex-A55 (Snapdragon 888) benchmarks (no-opt)
Details
| Benchmark suite | Current: f6503be | Previous: 8933db9 | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
462570 cycles |
461655 cycles |
1.00 |
ML-DSA-44 sign |
2138045 cycles |
2136241 cycles |
1.00 |
ML-DSA-44 verify |
547668 cycles |
547092 cycles |
1.00 |
ML-DSA-65 keypair |
774877 cycles |
774826 cycles |
1.00 |
ML-DSA-65 sign |
3499244 cycles |
3500411 cycles |
1.00 |
ML-DSA-65 verify |
849141 cycles |
850053 cycles |
1.00 |
ML-DSA-87 keypair |
1251533 cycles |
1249873 cycles |
1.00 |
ML-DSA-87 sign |
4339029 cycles |
4335460 cycles |
1.00 |
ML-DSA-87 verify |
1364449 cycles |
1366205 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
CBMC Results (ML-DSA-44)
Full Results (173 proofs)
|
CBMC Results (ML-DSA-87)
Full Results (173 proofs)
|
CBMC Results (ML-DSA-65)
Full Results (173 proofs)
|
This commit splits up the signature unpacking into two parts: unpack_sig_c_z and unpack_sig_h.
This allows delaying unpacking of the hints h until later during the verification which in turn allows re-using the buffer used for the matrix. This cuts memory consumption by L KiB.
crypto_sign_verify_internalstack usage #751