-
Notifications
You must be signed in to change notification settings - Fork 476
[Tracker] Expanded BF16 Support #9728
Copy link
Copy link
Open
Description
Tracking issue for expanding BF16 support. See #9599 for initial discussion.
- Support bf16 <-> fp32 convert ops.
- Scalar: Add scalar f32-bf16-vcvt support #9727
- Neon (w/o FEAT_BF16)
- Neon (w/ FEAT_BF16)
- SSE/AVX (target TBD)
- AVX512_BF16
- Support bf16 <-> qs/qu8 convert ops.
- Scalar
- Neon (w/o FEAT_BF16)
- Neon (w/ FEAT_BF16)
- SSE/AVX (target TBD)
- AVX512_BF16
- Update src/qs8-gemm/ to add qd8-bf16 variants.
- Scalar
- SSE2
- AVX2
- AVX512
- AVX512_BF16
- Neon (w/o FEAT_BF16)
- Neon (w/ FEAT_BF16)
- Wire up bf16 in the subgraph layer for supported operators.
- Support subgraph rewrite (bf16 -> fp32) for ops without native bf16 support.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels