Run TorchTitan GraphTrainer AutoParallel CI#452
Draft
sanketpurandare wants to merge 1 commit into
Draft
Conversation
sanketpurandare
added a commit
that referenced
this pull request
May 8, 2026
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
d32c5ae to
4b5fd26
Compare
sanketpurandare
added a commit
that referenced
this pull request
May 8, 2026
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
4b5fd26 to
d6c67bb
Compare
sanketpurandare
added a commit
that referenced
this pull request
May 8, 2026
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
d6c67bb to
1d7789a
Compare
sanketpurandare
added a commit
that referenced
this pull request
May 8, 2026
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
1d7789a to
7b1fb64
Compare
749a930 to
0592da0
Compare
sanketpurandare
added a commit
that referenced
this pull request
May 8, 2026
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
7b1fb64 to
bcca6c2
Compare
0592da0 to
e55d303
Compare
sanketpurandare
added a commit
that referenced
this pull request
May 8, 2026
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
bcca6c2 to
be22e1f
Compare
e55d303 to
5d994d4
Compare
sanketpurandare
added a commit
that referenced
this pull request
May 8, 2026
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
be22e1f to
f19b414
Compare
5d994d4 to
f235ee8
Compare
sanketpurandare
added a commit
that referenced
this pull request
May 8, 2026
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
f19b414 to
c4779f1
Compare
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP. Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner. stack-info: PR: #452, branch: sanketpurandare/stack/8
f235ee8 to
b61c192
Compare
c4779f1 to
66c6ad6
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Stacked PRs:
Run TorchTitan GraphTrainer AutoParallel CI
Extend the TorchTitan integration workflow to run the GraphTrainer AutoParallel integration tests for Llama3 FSDP+TP and DeepSeek V3 EFSDP+EP.
Also run the GraphTrainer AutoParallel numerics tests for Llama3 and DeepSeek V3. The DeepSeek V3 commands disable NCCL NVLS to match the stable TorchTitan numerics setup on the four-GPU AutoParallel CI runner.