Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(0506): Layerwise export: API-driven, env-var-free, opt-in flag 1.22 Release 1.22 candidate enhancement New feature or request
#1047 opened Jun 5, 2026 by vbaddi Contributor Loading…
1 task done
feat(0506): Add optional KV-cache buffer-name prefix for vLLM disaggr…
#1046 opened Jun 5, 2026 by vbaddi Contributor Loading…
3 tasks done
Reranker single qpc
#1045 opened Jun 5, 2026 by quic-amitraj Contributor Draft
[Nightly-CI]: Adding nightly validation in release/1.22.0_tmp branch
#1044 opened Jun 5, 2026 by abukhoy Contributor Loading…
Fix for logits issue in Qwen3 VL MOE
#1043 opened Jun 5, 2026 by tv-karthikeya Contributor Loading…
KV handoff with DMA slicing APIs to avoid KV input/output copies.
#1039 opened Jun 4, 2026 by quic-akuruvil Contributor Loading…
[EB] Qwen_3_5_Moe 1.22 Release 1.22 candidate
#1038 opened Jun 4, 2026 by mohiso22 Contributor Loading…
Repeatkv transform 1.22 Release 1.22 candidate
#1037 opened Jun 4, 2026 by quic-dhirajku Contributor Loading…
feat(0406): Add Gemma4 Unified vision-language support enhancement New feature or request
#1036 opened Jun 4, 2026 by vbaddi Contributor Loading…
Reranker & Embedding: Qwen3-VL single-shot inference with single-specialization compile 1.22 Release 1.22 candidate embedding This label is for all the PR related to embedding model. reranker This label is for all the PR related to reranker model.
#1031 opened Jun 3, 2026 by quic-amitraj Contributor Loading…
Release/v1.22.0 tmp
#1029 opened Jun 3, 2026 by quic-rishinr Contributor Loading…
fix(0306): MoE prefill reductions for subfunction export 1.22 Release 1.22 candidate bugfix
#1028 opened Jun 3, 2026 by vbaddi Contributor Loading…
ci(0306): speed up QAIC PR tests with safe parallelism enhancement New feature or request
#1025 opened Jun 2, 2026 by vbaddi Contributor Loading…
Feat/moe nsp blocking all models
#1016 opened Jun 1, 2026 by divytrip3005 Loading…
[gh-pages]: Release/v1.21.6 Github page Added
#1015 opened Jun 1, 2026 by abukhoy Contributor Loading…
Add Support for MiniMax 2.5 Model
#1014 opened May 30, 2026 by vbaddi Contributor Draft
feat(skip-softmax): Add skip-softmax support for KV-blocked attention enhancement New feature or request
#1009 opened May 25, 2026 by vbaddi Contributor Draft
Qwen image with magcache Diffusers Use for PR related to diffusers in efficient-transformers. performance
#998 opened May 20, 2026 by quic-amitraj Contributor Loading…
Dflash: Block Diffusion Speculative Decoding
#995 opened May 18, 2026 by vjanfaza Contributor Draft
Ft_v1 QAIC-profiler hotfix
#994 opened May 18, 2026 by quic-akuruvil Contributor Loading…
Magcache support for Diffuser Diffusers Use for PR related to diffusers in efficient-transformers. performance
#993 opened May 18, 2026 by quic-amitraj Contributor Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.