Avx2 keccakx4 trivial loop bench#1549
Conversation
There was a problem hiding this comment.
Mac Mini (M1, 2020) benchmarks
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
12327 cycles |
12325 cycles |
1.00 |
ML-KEM-512 encaps |
15030 cycles |
15029 cycles |
1.00 |
ML-KEM-512 decaps |
19606 cycles |
19606 cycles |
1 |
ML-KEM-768 keypair |
21089 cycles |
21089 cycles |
1 |
ML-KEM-768 encaps |
23851 cycles |
23852 cycles |
1.00 |
ML-KEM-768 decaps |
30431 cycles |
30427 cycles |
1.00 |
ML-KEM-1024 keypair |
30389 cycles |
30388 cycles |
1.00 |
ML-KEM-1024 encaps |
34647 cycles |
34647 cycles |
1 |
ML-KEM-1024 decaps |
44274 cycles |
44273 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
ppc64le (POWER10) benchmarks
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
59545 cycles |
59172 cycles |
1.01 |
ML-KEM-512 encaps |
72248 cycles |
71904 cycles |
1.00 |
ML-KEM-512 decaps |
91935 cycles |
91485 cycles |
1.00 |
ML-KEM-768 keypair |
99436 cycles |
98898 cycles |
1.01 |
ML-KEM-768 encaps |
115919 cycles |
115162 cycles |
1.01 |
ML-KEM-768 decaps |
141731 cycles |
140953 cycles |
1.01 |
ML-KEM-1024 keypair |
150862 cycles |
148300 cycles |
1.02 |
ML-KEM-1024 encaps |
170096 cycles |
167223 cycles |
1.02 |
ML-KEM-1024 decaps |
201671 cycles |
197970 cycles |
1.02 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A55 (Snapdragon 888) benchmarks
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
59523 cycles |
59521 cycles |
1.00 |
ML-KEM-512 encaps |
67226 cycles |
67229 cycles |
1.00 |
ML-KEM-512 decaps |
85809 cycles |
85804 cycles |
1.00 |
ML-KEM-768 keypair |
97056 cycles |
96994 cycles |
1.00 |
ML-KEM-768 encaps |
110877 cycles |
110345 cycles |
1.00 |
ML-KEM-768 decaps |
137492 cycles |
137732 cycles |
1.00 |
ML-KEM-1024 keypair |
154014 cycles |
153929 cycles |
1.00 |
ML-KEM-1024 encaps |
171068 cycles |
172288 cycles |
0.99 |
ML-KEM-1024 decaps |
206658 cycles |
207746 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
28352 cycles |
28397 cycles |
1.00 |
ML-KEM-512 encaps |
34316 cycles |
34356 cycles |
1.00 |
ML-KEM-512 decaps |
44535 cycles |
44625 cycles |
1.00 |
ML-KEM-768 keypair |
47860 cycles |
47827 cycles |
1.00 |
ML-KEM-768 encaps |
54218 cycles |
54220 cycles |
1.00 |
ML-KEM-768 decaps |
68712 cycles |
68666 cycles |
1.00 |
ML-KEM-1024 keypair |
70593 cycles |
70592 cycles |
1.00 |
ML-KEM-1024 encaps |
79130 cycles |
79148 cycles |
1.00 |
ML-KEM-1024 decaps |
98827 cycles |
98830 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
SpacemiT K1 8 (Banana Pi F3) benchmarks
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
155135 cycles |
155095 cycles |
1.00 |
ML-KEM-512 encaps |
163327 cycles |
163265 cycles |
1.00 |
ML-KEM-512 decaps |
206580 cycles |
206458 cycles |
1.00 |
ML-KEM-768 keypair |
249525 cycles |
249494 cycles |
1.00 |
ML-KEM-768 encaps |
270302 cycles |
270253 cycles |
1.00 |
ML-KEM-768 decaps |
332113 cycles |
332056 cycles |
1.00 |
ML-KEM-1024 keypair |
395104 cycles |
395171 cycles |
1.00 |
ML-KEM-1024 encaps |
423669 cycles |
423760 cycles |
1.00 |
ML-KEM-1024 decaps |
505496 cycles |
505500 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
50814 cycles |
50728 cycles |
1.00 |
ML-KEM-512 encaps |
59134 cycles |
58464 cycles |
1.01 |
ML-KEM-512 decaps |
75596 cycles |
74142 cycles |
1.02 |
ML-KEM-768 keypair |
87232 cycles |
86364 cycles |
1.01 |
ML-KEM-768 encaps |
95628 cycles |
94317 cycles |
1.01 |
ML-KEM-768 decaps |
118116 cycles |
117130 cycles |
1.01 |
ML-KEM-1024 keypair |
131476 cycles |
130149 cycles |
1.01 |
ML-KEM-1024 encaps |
143198 cycles |
142791 cycles |
1.00 |
ML-KEM-1024 decaps |
172979 cycles |
174398 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
CBMC Results (ML-KEM-512)Full Results (151 proofs)
|
CBMC Results (ML-KEM-768)Full Results (151 proofs)
|
CBMC Results (ML-KEM-1024)Full Results (151 proofs)
|
Signed-off-by: manastasova <manastasova2017@fau.edu>
Signed-off-by: manastasova <manastasova2017@fau.edu>
Signed-off-by: manastasova <manastasova2017@fau.edu>
Signed-off-by: manastasova <manastasova2017@fau.edu>
Signed-off-by: manastasova <manastasova2017@fau.edu>
265a377 to
af21fb3
Compare
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 4th gen (c7i)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
9556 cycles |
9673 cycles |
0.99 |
ML-KEM-512 encaps |
10932 cycles |
11477 cycles |
0.95 |
ML-KEM-512 decaps |
15058 cycles |
15381 cycles |
0.98 |
ML-KEM-768 keypair |
16392 cycles |
16417 cycles |
1.00 |
ML-KEM-768 encaps |
17697 cycles |
17862 cycles |
0.99 |
ML-KEM-768 decaps |
23875 cycles |
23516 cycles |
1.02 |
ML-KEM-1024 keypair |
22305 cycles |
22324 cycles |
1.00 |
ML-KEM-1024 encaps |
24559 cycles |
24598 cycles |
1.00 |
ML-KEM-1024 decaps |
32295 cycles |
32492 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 3rd gen (c6a)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
14065 cycles |
16694 cycles |
0.84 |
ML-KEM-512 encaps |
15769 cycles |
18347 cycles |
0.86 |
ML-KEM-512 decaps |
21130 cycles |
23698 cycles |
0.89 |
ML-KEM-768 keypair |
23751 cycles |
28491 cycles |
0.83 |
ML-KEM-768 encaps |
25032 cycles |
29745 cycles |
0.84 |
ML-KEM-768 decaps |
32982 cycles |
37597 cycles |
0.88 |
ML-KEM-1024 keypair |
33466 cycles |
41238 cycles |
0.81 |
ML-KEM-1024 encaps |
35796 cycles |
43448 cycles |
0.82 |
ML-KEM-1024 decaps |
46212 cycles |
53882 cycles |
0.86 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 4th gen (c7i) (no-opt)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
28458 cycles |
28486 cycles |
1.00 |
ML-KEM-512 encaps |
35805 cycles |
35832 cycles |
1.00 |
ML-KEM-512 decaps |
45572 cycles |
45388 cycles |
1.00 |
ML-KEM-768 keypair |
45946 cycles |
45961 cycles |
1.00 |
ML-KEM-768 encaps |
55956 cycles |
56259 cycles |
0.99 |
ML-KEM-768 decaps |
69443 cycles |
69350 cycles |
1.00 |
ML-KEM-1024 keypair |
71747 cycles |
71758 cycles |
1.00 |
ML-KEM-1024 encaps |
84456 cycles |
84551 cycles |
1.00 |
ML-KEM-1024 decaps |
101536 cycles |
101569 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 3rd gen (c6i)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
14466 cycles |
16360 cycles |
0.88 |
ML-KEM-512 encaps |
16857 cycles |
18696 cycles |
0.90 |
ML-KEM-512 decaps |
23388 cycles |
25288 cycles |
0.92 |
ML-KEM-768 keypair |
25398 cycles |
29401 cycles |
0.86 |
ML-KEM-768 encaps |
26818 cycles |
29789 cycles |
0.90 |
ML-KEM-768 decaps |
36512 cycles |
39271 cycles |
0.93 |
ML-KEM-1024 keypair |
35940 cycles |
37691 cycles |
0.95 |
ML-KEM-1024 encaps |
36379 cycles |
40668 cycles |
0.89 |
ML-KEM-1024 decaps |
49505 cycles |
54398 cycles |
0.91 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 4th gen (c7a)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
10566 cycles |
11999 cycles |
0.88 |
ML-KEM-512 encaps |
12186 cycles |
13180 cycles |
0.92 |
ML-KEM-512 decaps |
17026 cycles |
18037 cycles |
0.94 |
ML-KEM-768 keypair |
18422 cycles |
20544 cycles |
0.90 |
ML-KEM-768 encaps |
19458 cycles |
21544 cycles |
0.90 |
ML-KEM-768 decaps |
26625 cycles |
28655 cycles |
0.93 |
ML-KEM-1024 keypair |
24696 cycles |
27796 cycles |
0.89 |
ML-KEM-1024 encaps |
26983 cycles |
29927 cycles |
0.90 |
ML-KEM-1024 decaps |
36303 cycles |
39285 cycles |
0.92 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 3rd gen (c6a) (no-opt)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
40291 cycles |
40267 cycles |
1.00 |
ML-KEM-512 encaps |
48317 cycles |
48316 cycles |
1.00 |
ML-KEM-512 decaps |
62449 cycles |
62424 cycles |
1.00 |
ML-KEM-768 keypair |
63569 cycles |
63564 cycles |
1.00 |
ML-KEM-768 encaps |
74735 cycles |
74898 cycles |
1.00 |
ML-KEM-768 decaps |
93150 cycles |
93130 cycles |
1.00 |
ML-KEM-1024 keypair |
94966 cycles |
94896 cycles |
1.00 |
ML-KEM-1024 encaps |
109013 cycles |
108876 cycles |
1.00 |
ML-KEM-1024 decaps |
131917 cycles |
131809 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton3
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
18745 cycles |
18742 cycles |
1.00 |
ML-KEM-512 encaps |
22006 cycles |
22003 cycles |
1.00 |
ML-KEM-512 decaps |
29029 cycles |
29023 cycles |
1.00 |
ML-KEM-768 keypair |
31777 cycles |
31761 cycles |
1.00 |
ML-KEM-768 encaps |
34923 cycles |
34920 cycles |
1.00 |
ML-KEM-768 decaps |
45031 cycles |
45029 cycles |
1.00 |
ML-KEM-1024 keypair |
46378 cycles |
46373 cycles |
1.00 |
ML-KEM-1024 encaps |
51720 cycles |
51719 cycles |
1.00 |
ML-KEM-1024 decaps |
65247 cycles |
65250 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 4th gen (c7a) (no-opt)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
36516 cycles |
36497 cycles |
1.00 |
ML-KEM-512 encaps |
43033 cycles |
42982 cycles |
1.00 |
ML-KEM-512 decaps |
55702 cycles |
55646 cycles |
1.00 |
ML-KEM-768 keypair |
58639 cycles |
58420 cycles |
1.00 |
ML-KEM-768 encaps |
67395 cycles |
67359 cycles |
1.00 |
ML-KEM-768 decaps |
84468 cycles |
84372 cycles |
1.00 |
ML-KEM-1024 keypair |
88706 cycles |
88675 cycles |
1.00 |
ML-KEM-1024 encaps |
98904 cycles |
98961 cycles |
1.00 |
ML-KEM-1024 decaps |
120395 cycles |
120433 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 3rd gen (c6i) (no-opt)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
45808 cycles |
45849 cycles |
1.00 |
ML-KEM-512 encaps |
54744 cycles |
54779 cycles |
1.00 |
ML-KEM-512 decaps |
70267 cycles |
70327 cycles |
1.00 |
ML-KEM-768 keypair |
73772 cycles |
73822 cycles |
1.00 |
ML-KEM-768 encaps |
85228 cycles |
85296 cycles |
1.00 |
ML-KEM-768 decaps |
106252 cycles |
106294 cycles |
1.00 |
ML-KEM-1024 keypair |
111786 cycles |
111742 cycles |
1.00 |
ML-KEM-1024 encaps |
125899 cycles |
125872 cycles |
1.00 |
ML-KEM-1024 decaps |
151997 cycles |
151691 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton3 (no-opt)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
39060 cycles |
39071 cycles |
1.00 |
ML-KEM-512 encaps |
44638 cycles |
44643 cycles |
1.00 |
ML-KEM-512 decaps |
56702 cycles |
56716 cycles |
1.00 |
ML-KEM-768 keypair |
62443 cycles |
62450 cycles |
1.00 |
ML-KEM-768 encaps |
70937 cycles |
70948 cycles |
1.00 |
ML-KEM-768 decaps |
86874 cycles |
86891 cycles |
1.00 |
ML-KEM-1024 keypair |
96266 cycles |
96265 cycles |
1.00 |
ML-KEM-1024 encaps |
106329 cycles |
106329 cycles |
1 |
ML-KEM-1024 decaps |
126776 cycles |
126794 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton2
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
28353 cycles |
28351 cycles |
1.00 |
ML-KEM-512 encaps |
34324 cycles |
34327 cycles |
1.00 |
ML-KEM-512 decaps |
44558 cycles |
44552 cycles |
1.00 |
ML-KEM-768 keypair |
47850 cycles |
47859 cycles |
1.00 |
ML-KEM-768 encaps |
54200 cycles |
54212 cycles |
1.00 |
ML-KEM-768 decaps |
68726 cycles |
68737 cycles |
1.00 |
ML-KEM-1024 keypair |
70659 cycles |
70597 cycles |
1.00 |
ML-KEM-1024 encaps |
79239 cycles |
79162 cycles |
1.00 |
ML-KEM-1024 decaps |
99009 cycles |
98902 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton4
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
17765 cycles |
17725 cycles |
1.00 |
ML-KEM-512 encaps |
20770 cycles |
20732 cycles |
1.00 |
ML-KEM-512 decaps |
27205 cycles |
27157 cycles |
1.00 |
ML-KEM-768 keypair |
30095 cycles |
30056 cycles |
1.00 |
ML-KEM-768 encaps |
32881 cycles |
32832 cycles |
1.00 |
ML-KEM-768 decaps |
42235 cycles |
42114 cycles |
1.00 |
ML-KEM-1024 keypair |
44036 cycles |
43983 cycles |
1.00 |
ML-KEM-1024 encaps |
49079 cycles |
49010 cycles |
1.00 |
ML-KEM-1024 decaps |
61941 cycles |
61573 cycles |
1.01 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton4 (no-opt)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
35500 cycles |
35503 cycles |
1.00 |
ML-KEM-512 encaps |
40226 cycles |
40237 cycles |
1.00 |
ML-KEM-512 decaps |
51217 cycles |
51238 cycles |
1.00 |
ML-KEM-768 keypair |
56803 cycles |
56812 cycles |
1.00 |
ML-KEM-768 encaps |
64724 cycles |
64720 cycles |
1.00 |
ML-KEM-768 decaps |
79038 cycles |
79042 cycles |
1.00 |
ML-KEM-1024 keypair |
88036 cycles |
88032 cycles |
1.00 |
ML-KEM-1024 encaps |
97177 cycles |
97178 cycles |
1.00 |
ML-KEM-1024 decaps |
116101 cycles |
116101 cycles |
1 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton2 (no-opt)
Details
| Benchmark suite | Current: af21fb3 | Previous: 09c31cb | Ratio |
|---|---|---|---|
ML-KEM-512 keypair |
59177 cycles |
59157 cycles |
1.00 |
ML-KEM-512 encaps |
68651 cycles |
68643 cycles |
1.00 |
ML-KEM-512 decaps |
87408 cycles |
87390 cycles |
1.00 |
ML-KEM-768 keypair |
95658 cycles |
95754 cycles |
1.00 |
ML-KEM-768 encaps |
109514 cycles |
109601 cycles |
1.00 |
ML-KEM-768 decaps |
134406 cycles |
134498 cycles |
1.00 |
ML-KEM-1024 keypair |
146936 cycles |
148574 cycles |
0.99 |
ML-KEM-1024 encaps |
163016 cycles |
164464 cycles |
0.99 |
ML-KEM-1024 decaps |
194266 cycles |
195790 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
|
@manastasova Is this still relevant? Closing for now. Please re-open and comment on your plans for it otherwise. |
This commit aims to compare performance with #1521. This commit differs from #1521 by: