Skip to content

Pull requests: radixark/miles

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add chunked TP logprob loss
#1068 opened May 4, 2026 by mouad-hpc Loading…
[DO NOT MERGE] CI test run run-ci-megatron
#1066 opened May 4, 2026 by guapisolo Collaborator Loading…
[TITO] model-support: add DeepSeek V4 TITO support
#1065 opened May 4, 2026 by zyzshishui Contributor Loading…
[TITO] feat (4/N): Use prompt ids from miles session server
#1064 opened May 4, 2026 by zyzshishui Contributor Loading…
Random fully async agent example
#1062 opened May 3, 2026 by jthomy Draft
[feat] Init true on policy with qwen_moe
#1059 opened May 1, 2026 by maocheng23 Contributor Draft
5 tasks
Add --miles-nsa-topk-backend
#1058 opened May 1, 2026 by zianglih Contributor Loading…
[feat] Init true on policy with qwen_dense
#1052 opened Apr 28, 2026 by maocheng23 Contributor Loading…
5 tasks
DeepSeek V4 RL support
#1045 opened Apr 24, 2026 by yueming-yuan Collaborator Loading…
Offload session-server tokenization to a threadpool
#1041 opened Apr 23, 2026 by Shi-Dong Contributor Loading…
3 tasks
[swe-agent-v2] Randomize agent_name per request
#1040 opened Apr 23, 2026 by Shi-Dong Contributor Loading…
2 tasks
[rollout] TITO refactors
#1039 opened Apr 22, 2026 by guapisolo Collaborator Draft
Expose agent-server trial timeout as a configurable launcher flag
#1036 opened Apr 22, 2026 by Shi-Dong Contributor Loading…
2 tasks
feat: rollout-data replay and all-gather CP for megatron runner
#1030 opened Apr 21, 2026 by yueming-yuan Collaborator Loading…
4 tasks
Add GLM-4.7 Full agentic async training launch script
#1029 opened Apr 21, 2026 by Shi-Dong Contributor Loading…
1 of 2 tasks
[rollout] add --max-concurrent-agent-tasks to cap agentic in-flight requests
#1026 opened Apr 21, 2026 by Shi-Dong Contributor Loading…
3 tasks
ProTip! Updated in the last three days: updated:>2026-05-01.