-
Notifications
You must be signed in to change notification settings - Fork 491
Pull requests: pytorch/ao
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(utils): propagate non_blocking in TorchAOBaseTensor._to_copy and _get_to_kwargs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4297
opened Apr 19, 2026 by
Dev-next-gen
Loading…
small fix to low precision attention benchmarking to force baseline contexts
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4296
opened Apr 18, 2026 by
howardzhang-cv
Contributor
Loading…
add FSDP and TP tests for Float8BlockwiseLinear
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4295
opened Apr 17, 2026 by
iamzainhuda
Contributor
•
Draft
Move collect_producer_nodes to graph_utils.py
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4294
opened Apr 17, 2026 by
tom-arm
Contributor
Loading…
[mxfp8 moe training] fuse dynamic per-group padding into cutedsl 2d mxfp8 quant kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4293
opened Apr 17, 2026 by
MagellaX
Contributor
Loading…
basic enablement for mxfp8 and mxfp4 inference on AMD MI350x
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: inference
quantize_ api inference flow
#4290
opened Apr 16, 2026 by
vkuzo
Contributor
Loading…
Add iterative reweighting to PruneOptimizer
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4283
opened Apr 15, 2026 by
lisjin
Contributor
Loading…
[AARCH64] Enable MKLDNN Backend for Int8DynamicActivationInt8WeightConfig() on ARM
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4281
opened Apr 15, 2026 by
agrawal-aka
Contributor
Loading…
Add Sparse2x4HIPSPARSELTFloat8Tensor (#4277)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
fb-exported
meta-exported
module: rocm
#4277
opened Apr 14, 2026 by
bbeckca
Contributor
Loading…
[ROCm] Add MXFP8 training support for gfx950 (MI355X)
ciflow/rocm-mi300
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: rocm
[xpu][mx][test] Enable mx serialization tests on xpu
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4272
opened Apr 13, 2026 by
ugolowic
Contributor
Loading…
[xpu][mx] Fix NaN scale propagation in RCEIL triton kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4271
opened Apr 13, 2026 by
ugolowic
Contributor
Loading…
[optim] Add GrokAdamW optimizer with low-bit quantization support
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4270
opened Apr 13, 2026 by
vaibhavhariram
Loading…
Add torch.uint16, torch.uint32
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4269
opened Apr 12, 2026 by
Freed-Wu
Loading…
Add reduce_range to avoid overflow in int8 tensor
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4266
opened Apr 10, 2026 by
cyxlily
Contributor
Loading…
Support 32x32 scaling for weights in MXFP8 weight quantization kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: training
quantize_ api training flow
moe
mx
#4254
opened Apr 9, 2026 by
alexsamardzic
Collaborator
Loading…
[nvfp4 training] add autograd support for NVFP4 emulated grouped GEMM
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4252
opened Apr 9, 2026 by
roycho96
Contributor
Loading…
[xpu][mx] Enable mx matmul tests on xpu
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4251
opened Apr 8, 2026 by
ugolowic
Contributor
Loading…
[ROCm] Switch CI to allowlist of validated tests
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
topic: rocm
#4248
opened Apr 7, 2026 by
brucechanglongxu
Contributor
Loading…
[nvfp4_training] Add Triton kernel for global amax of columnwise RHT (SM90+)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: training
quantize_ api training flow
nvfp4
#4247
opened Apr 7, 2026 by
rdspring1
Loading…
2 tasks
Build GPU profile benchmark
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: inference
quantize_ api inference flow
topic: for developers
Use this tag if this PR is mainly developer facing
#4244
opened Apr 6, 2026 by
namgyu-youn
Contributor
Loading…
5 tasks
[test] Refactor test_quant_api to use shared ToyTwoLinearModel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4236
opened Apr 3, 2026 by
abhinav-TB
Loading…
Move BlockSparseWeightConfig to prototype and deprecate torchao.sparsity imports (#4230)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: deprecation
Use this tag if this PR deprecates a feature
#4232
opened Apr 3, 2026 by
jerryzh168
Contributor
Loading…
Fix #4106: ./test/float8/test_everything_multi_gpu.sh failing in 4xH1...
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4222
opened Apr 2, 2026 by
JiwaniZakir
Loading…
Add aten.view.dtype dispatch to optim state subclasses
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: training
quantize_ api training flow
#4216
opened Apr 1, 2026 by
winglian
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-16.