Skip to content

Pull requests: pytorch/ao

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(utils): propagate non_blocking in TorchAOBaseTensor._to_copy and _get_to_kwargs CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4297 opened Apr 19, 2026 by Dev-next-gen Loading…
small fix to low precision attention benchmarking to force baseline contexts CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4296 opened Apr 18, 2026 by howardzhang-cv Contributor Loading…
add FSDP and TP tests for Float8BlockwiseLinear CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4295 opened Apr 17, 2026 by iamzainhuda Contributor Draft
Move collect_producer_nodes to graph_utils.py CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4294 opened Apr 17, 2026 by tom-arm Contributor Loading…
[mxfp8 moe training] fuse dynamic per-group padding into cutedsl 2d mxfp8 quant kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4293 opened Apr 17, 2026 by MagellaX Contributor Loading…
basic enablement for mxfp8 and mxfp4 inference on AMD MI350x CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: inference quantize_ api inference flow
#4290 opened Apr 16, 2026 by vkuzo Contributor Loading…
Add iterative reweighting to PruneOptimizer CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4283 opened Apr 15, 2026 by lisjin Contributor Loading…
[AARCH64] Enable MKLDNN Backend for Int8DynamicActivationInt8WeightConfig() on ARM CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4281 opened Apr 15, 2026 by agrawal-aka Contributor Loading…
Add Sparse2x4HIPSPARSELTFloat8Tensor (#4277) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported meta-exported module: rocm
#4277 opened Apr 14, 2026 by bbeckca Contributor Loading…
[ROCm] Add MXFP8 training support for gfx950 (MI355X) ciflow/rocm-mi300 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: rocm
#4275 opened Apr 14, 2026 by indianspeedster Loading… MXFP8 Training
[xpu][mx][test] Enable mx serialization tests on xpu CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4272 opened Apr 13, 2026 by ugolowic Contributor Loading…
[xpu][mx] Fix NaN scale propagation in RCEIL triton kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4271 opened Apr 13, 2026 by ugolowic Contributor Loading…
[optim] Add GrokAdamW optimizer with low-bit quantization support CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4270 opened Apr 13, 2026 by vaibhavhariram Loading…
Add torch.uint16, torch.uint32 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4269 opened Apr 12, 2026 by Freed-Wu Loading…
Add reduce_range to avoid overflow in int8 tensor CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4266 opened Apr 10, 2026 by cyxlily Contributor Loading…
Support 32x32 scaling for weights in MXFP8 weight quantization kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: training quantize_ api training flow moe mx
#4254 opened Apr 9, 2026 by alexsamardzic Collaborator Loading…
[nvfp4 training] add autograd support for NVFP4 emulated grouped GEMM CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4252 opened Apr 9, 2026 by roycho96 Contributor Loading…
[xpu][mx] Enable mx matmul tests on xpu CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4251 opened Apr 8, 2026 by ugolowic Contributor Loading…
[ROCm] Switch CI to allowlist of validated tests CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes topic: rocm
#4248 opened Apr 7, 2026 by brucechanglongxu Contributor Loading…
[nvfp4_training] Add Triton kernel for global amax of columnwise RHT (SM90+) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: training quantize_ api training flow nvfp4
#4247 opened Apr 7, 2026 by rdspring1 Loading…
2 tasks
Build GPU profile benchmark CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: inference quantize_ api inference flow topic: for developers Use this tag if this PR is mainly developer facing
#4244 opened Apr 6, 2026 by namgyu-youn Contributor Loading…
5 tasks
[test] Refactor test_quant_api to use shared ToyTwoLinearModel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4236 opened Apr 3, 2026 by abhinav-TB Loading…
Move BlockSparseWeightConfig to prototype and deprecate torchao.sparsity imports (#4230) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: deprecation Use this tag if this PR deprecates a feature
#4232 opened Apr 3, 2026 by jerryzh168 Contributor Loading…
Fix #4106: ./test/float8/test_everything_multi_gpu.sh failing in 4xH1... CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4222 opened Apr 2, 2026 by JiwaniZakir Loading…
Add aten.view.dtype dispatch to optim state subclasses CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: training quantize_ api training flow
#4216 opened Apr 1, 2026 by winglian Loading…
ProTip! no:milestone will show everything without a milestone.