intel / auto-round Public

Notifications You must be signed in to change notification settings
Fork 78
Star 845

Code
Issues 85
Pull requests 19
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: intel/auto-round

Labels 28 Milestones 3

New pull request New

19 Open 1,006 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

adapt vllm_ext to new extra config

#1409 opened Feb 5, 2026 by mengniwang95

Loading…

1 of 6 tasks

0.10.0

Support Qwen3 Omni model quantization

#1404 opened Feb 4, 2026 by lvliang-intel • Draft

2 of 9 tasks

Refactor evaluation in tests to use evaluate_accuracy function

#1402 opened Feb 4, 2026 by xin3he

Loading…

1 of 9 tasks

1.0.0

support gpt-oss mxfp4 directly loading

#1401 opened Feb 4, 2026 by xin3he

Loading…

1 of 9 tasks

support multiple device evaluation for activation quantized model

#1394 opened Feb 4, 2026 by wenhuach21

Loading…

9 tasks

[Experimental][Won't Merge] DDP PoC won't merge

#1391 opened Feb 4, 2026 by yiliu30 • Draft

9 tasks

Optimize CPU RAM peak memory during quantization

#1386 opened Feb 3, 2026 by lvliang-intel

Loading…

3 of 9 tasks

Refactor module access to use PyTorch get/set_submodule API

#1365 opened Jan 29, 2026 by scopophobic

Loading…

support hadamard transform for mxfp4 with rtn or autoround method.

#1349 opened Jan 27, 2026 by lkk12014402

Loading…

refactor init of compressor engineering ready

only add when the PR is ready to merge

#1339 opened Jan 26, 2026 by n1ck-guo

Loading…

1 of 9 tasks

Add asym for XPU backend.

#1316 opened Jan 22, 2026 by luoyu-intel • Draft

Update torch to 2.9.1 in CI

#1313 opened Jan 22, 2026 by XuehaoSun

Loading…

align act_max of experts for qwen3-vl and qwen3-next

#1311 opened Jan 21, 2026 by xin3he • Draft

Robust FP8 layer detection for ignore_layers (#1283)

#1289 opened Jan 15, 2026 by scopophobic

Loading…

Fix ignore_layers not working for FP8 models

#1286 opened Jan 15, 2026 by Copilot AI

Loading…

11 tasks done

[WIP][refactor quanizers][step 1] refactor rtn and tuning

#1278 opened Jan 14, 2026 by n1ck-guo

Loading…

fix disable_opt_rtn spelling error

#1250 opened Jan 9, 2026 by WeiweiZhang1

Loading…

add per-task lm_eval args for exprimental usage

#1017 opened Nov 11, 2025 by WeiweiZhang1

Loading…

[WIP] [STEP 2] split compressor into few quantizers

#841 opened Sep 23, 2025 by n1ck-guo

Loading…

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!