Skip to content
#

avx2-optimized

Here are 4 public repositories matching this topic...

Language: All
Filter by language

IAA03_fast_math is a single-header math kernel(Atan2 only for now) designed to eliminate the "Trigonometry Tax" in high-throughput systems (Physics Engines, Audio DSP, and ML Pre-processing). Branchless ,ILP and SIMD (AVX2/SSE4.1), it achieves up to a ~186x per-element throughput speedup over std::atan2 while being IEEE 754 compliant

  • Updated Jan 14, 2026
  • C++

Improve this page

Add a description, image, and links to the avx2-optimized topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the avx2-optimized topic, visit your repo's landing page and select "manage topics."

Learn more