Skip to content

Address "Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics"#27

Merged
itzmeanjan merged 8 commits intomainfrom
26-optimize-gf28-multiplication-using-x86_64-gfni-intrinsics
Sep 22, 2025
Merged

Address "Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics"#27
itzmeanjan merged 8 commits intomainfrom
26-optimize-gf28-multiplication-using-x86_64-gfni-intrinsics

Conversation

@itzmeanjan
Copy link
Copy Markdown
Owner

Closes #26

After introducing support for GFNI intrinsics, on AWS EC2 m7a.large with AMD EPYC 9R14, seeing some performance boost. Nothing too much exciting, but I'll keep it.

Before erasure-coding throughput was ~50GB/s, now it is ~60GB/s 🌟

Screenshot From 2025-09-21 14-46-44

…ucible polynomial `x^8 + x^4 + x^3 + x + 1`

Signed-off-by: Anjan Roy <hello@itzmeanjan.in>
Signed-off-by: Anjan Roy <hello@itzmeanjan.in>
Signed-off-by: Anjan Roy <hello@itzmeanjan.in>
@itzmeanjan itzmeanjan self-assigned this Sep 21, 2025
Signed-off-by: Anjan Roy <hello@itzmeanjan.in>
Signed-off-by: Anjan Roy <hello@itzmeanjan.in>
…EADME

Signed-off-by: Anjan Roy <hello@itzmeanjan.in>
Signed-off-by: Anjan Roy <hello@itzmeanjan.in>
Signed-off-by: Anjan Roy <hello@itzmeanjan.in>
@itzmeanjan itzmeanjan merged commit 04d20d6 into main Sep 22, 2025
12 checks passed
@itzmeanjan itzmeanjan deleted the 26-optimize-gf28-multiplication-using-x86_64-gfni-intrinsics branch September 22, 2025 05:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics

1 participant