Skip to content

Pull requests: intel/auto-round

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

adapt vllm_ext to new extra config
#1409 opened Feb 5, 2026 by mengniwang95 Loading…
1 of 6 tasks
0.10.0
Support Qwen3 Omni model quantization
#1404 opened Feb 4, 2026 by lvliang-intel Draft
2 of 9 tasks
Refactor evaluation in tests to use evaluate_accuracy function
#1402 opened Feb 4, 2026 by xin3he Loading…
1 of 9 tasks
1.0.0
support gpt-oss mxfp4 directly loading
#1401 opened Feb 4, 2026 by xin3he Loading…
1 of 9 tasks
Optimize CPU RAM peak memory during quantization
#1386 opened Feb 3, 2026 by lvliang-intel Loading…
3 of 9 tasks
refactor init of compressor engineering ready only add when the PR is ready to merge
#1339 opened Jan 26, 2026 by n1ck-guo Loading…
1 of 9 tasks
Add asym for XPU backend.
#1316 opened Jan 22, 2026 by luoyu-intel Draft
Update torch to 2.9.1 in CI
#1313 opened Jan 22, 2026 by XuehaoSun Loading…
Fix ignore_layers not working for FP8 models
#1286 opened Jan 15, 2026 by Copilot AI Loading…
11 tasks done
fix disable_opt_rtn spelling error
#1250 opened Jan 9, 2026 by WeiweiZhang1 Loading…
add per-task lm_eval args for exprimental usage
#1017 opened Nov 11, 2025 by WeiweiZhang1 Loading…
[WIP] [STEP 2] split compressor into few quantizers
#841 opened Sep 23, 2025 by n1ck-guo Loading…
ProTip! Add no:assignee to see everything that’s not assigned.