Skip to content

Pull requests: aws-neuron/neuronx-distributed-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix all four scaling multipliers for Granite
#48 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Add ShardedRMSNorm for Q-K normalization under tensor parallelism
#47 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Add NoPE layer support and tied embeddings for SmolLM3-3B
#46 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Add Q-K normalization and scaled embeddings for Gemma-3-1b-it
#45 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Add LongRoPE and fix state dict conversion for Phi-3.5-mini-instruct
#44 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Fix state dict mapping and add partial RoPE for Phi-1.5
#43 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Update GPT-2 with Conv1D transposition and vocab padding
#42 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Update Pythia-2.8B GPTNeoX model with validated accuracy
#41 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Update StableLM-2-1.6B with partial RoPE and LayerNorm
#40 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Fix interleaved RoPE and partial rotary factor for GLM-4
#39 opened Feb 18, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Update OLMo-2-1B-Instruct with ShardedRMSNorm for TP Q-K norm
#38 opened Feb 17, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Enable OnDeviceSamplingConfig for compiler accuracy fix
#37 opened Feb 17, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
Updating AFM results
#36 opened Feb 17, 2026 by sdeeptan-aws Loading…
11 of 14 tasks
[Contribution] Gemma3-27B VLM
#34 opened Feb 5, 2026 by plienhar Loading…
14 tasks done
ProTip! Filter pull requests by the default branch with base:main.