Skip to content

Pull requests: allenai/open-instruct

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add use_cpu_adam flag for GRPO fast training
#1737 opened Jun 24, 2026 by Chessing234 Contributor Loading…
2 tasks
Fix token-weighted loss under gradient accumulation in finetune.py
#1736 opened Jun 24, 2026 by Chessing234 Contributor Loading…
2 tasks
Add tiled GRPO lm-head loss
#1735 opened Jun 23, 2026 by hamishivi Collaborator Loading…
Add tool-schema support to SFT tokenization
#1734 opened Jun 23, 2026 by hamishivi Collaborator Loading…
Add DPPO loss function
#1733 opened Jun 23, 2026 by hamishivi Collaborator Loading…
Make ModelDims.from_hf_config robust to explicit head_dim
#1731 opened Jun 23, 2026 by hamishivi Collaborator Loading…
Increase default environment pool acquire timeout to 7200s
#1729 opened Jun 23, 2026 by hamishivi Collaborator Loading…
Fix get_wandb_tags tag-limit handling and log all GRPO configs
#1727 opened Jun 17, 2026 by mnoukhov Contributor Loading…
Add OLMo-core 32B GRPO launch scripts
#1726 opened Jun 16, 2026 by mnoukhov Contributor Loading…
Now, DPO gets ~32% MFU.
#1720 opened Jun 10, 2026 by finbarrtimbers Collaborator Draft
Now, we support the hybrid model in our Olmo-core code.
#1713 opened Jun 2, 2026 by finbarrtimbers Collaborator Loading…
Wire max_checkpoints through SFT, DPO, and GRPO paths
#1701 opened May 27, 2026 by TimDettmers Loading…
4 tasks
Add olmo-eval Beaker launch integration for GRPO
#1698 opened May 22, 2026 by mnoukhov Contributor Draft
2 of 3 tasks
Add Trackio rollout trace logging
#1697 opened May 21, 2026 by abidlabs Loading…
Adds difficulty sampling curriculum dataloader and dataset builder
#1661 opened May 6, 2026 by undfined Collaborator Loading…
Replace submit_eval_jobs.py with thin wrapper around submit_eval_jobs.sh
#1658 opened May 6, 2026 by finbarrtimbers Collaborator Loading…
2 tasks
Add time/per_group_wall_time metric
#1656 opened May 5, 2026 by finbarrtimbers Collaborator Loading…
2 tasks
Make checkpointing better
#1647 opened Apr 29, 2026 by finbarrtimbers Collaborator Draft
3 tasks
Fix submit_eval_jobs.py for olmo-eval-internal runs
#1644 opened Apr 28, 2026 by finbarrtimbers Collaborator Loading…
4 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.