Add use_cpu_adam flag for GRPO fast training by Chessing234 · Pull Request #1737 · allenai/open-instruct

Chessing234 · 2026-06-24T15:31:44Z

Summary

Add --use_cpu_adam CLI flag to grpo_fast.py to use DeepSpeedCPUAdam instead of fused AdamW
Complements existing --deepspeed_offload_optimizer / --deepspeed_offload_param flags for low-VRAM local training

Test plan

Run grpo_fast.py with --use_cpu_adam true --deepspeed_offload_optimizer true on a single GPU
Confirm default behavior unchanged when --use_cpu_adam is omitted

Made with Cursor

Expose DeepSpeedCPUAdam as an optional optimizer when running grpo_fast with CPU optimizer offload on memory-constrained setups. Fixes allenai#1031. Co-authored-by: Cursor <cursoragent@cursor.com>

gemini-code-assist

Code Review

This pull request introduces a new '--use_cpu_adam' flag to 'grpo_fast.py' to support 'DeepSpeedCPUAdam' when training with CPU optimizer offload. The feedback suggests importing 'DeepSpeedCPUAdam' lazily at the point of use rather than at the top level of the file to prevent import-time errors or startup delays in environments where DeepSpeed is not installed or compiled.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-24T15:32:31Z

 with contextlib.suppress(Exception):
    import deepspeed
+    from deepspeed.ops.adam import DeepSpeedCPUAdam
    from deepspeed.runtime.sequence_parallel.ulysses_sp import UlyssesSPAttentionHF
    from deepspeed.utils import groups


Importing DeepSpeedCPUAdam at the top level can cause import-time failures or significant startup delays, especially in environments where DeepSpeed is not installed or where its C++ extensions are not compiled (e.g., local CPU-only development environments).

It is highly recommended to lazily import DeepSpeedCPUAdam only when it is actually needed (i.e., when args.use_cpu_adam is enabled).

Suggested change

with contextlib.suppress(Exception):

import deepspeed

from deepspeed.ops.adam import DeepSpeedCPUAdam

from deepspeed.runtime.sequence_parallel.ulysses_sp import UlyssesSPAttentionHF

from deepspeed.utils import groups

with contextlib.suppress(Exception):

import deepspeed

from deepspeed.runtime.sequence_parallel.ulysses_sp import UlyssesSPAttentionHF

from deepspeed.utils import groups

gemini-code-assist · 2026-06-24T15:32:31Z

+        if args.use_cpu_adam:
+            self.optimizer = DeepSpeedCPUAdam(optim_params, lr=args.learning_rate)
+        else:
+            self.optimizer = torch.optim.AdamW(optim_params, lr=args.learning_rate, fused=args.fused_optimizer)


Import DeepSpeedCPUAdam lazily here to prevent import-time errors and provide a clear, actionable error message if DeepSpeed is not installed or compiled with CPU Adam support.

if args.use_cpu_adam: try: from deepspeed.ops.adam import DeepSpeedCPUAdam except ImportError as e: raise ImportError( "DeepSpeedCPUAdam requires deepspeed to be installed and compiled with CPU Adam support. " "Please ensure deepspeed is installed correctly." ) from e self.optimizer = DeepSpeedCPUAdam(optim_params, lr=args.learning_rate) else: self.optimizer = torch.optim.AdamW(optim_params, lr=args.learning_rate, fused=args.fused_optimizer)

Co-authored-by: Cursor <cursoragent@cursor.com>

Reject incompatible combinations early and warn when CPU Adam is enabled without optimizer offload. Fixes allenai#1031 Co-authored-by: Cursor <cursoragent@cursor.com>

Add use_cpu_adam flag for GRPO fast training.

67cff66

Expose DeepSpeedCPUAdam as an optional optimizer when running grpo_fast with CPU optimizer offload on memory-constrained setups. Fixes allenai#1031. Co-authored-by: Cursor <cursoragent@cursor.com>

gemini-code-assist Bot reviewed Jun 24, 2026

View reviewed changes

Chessing234 and others added 2 commits June 24, 2026 21:03

Update CHANGELOG with PR number

557170b

Co-authored-by: Cursor <cursoragent@cursor.com>

Validate use_cpu_adam and deepspeed offload CLI flags.

157ad07

Reject incompatible combinations early and warn when CPU Adam is enabled without optimizer offload. Fixes allenai#1031 Co-authored-by: Cursor <cursoragent@cursor.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add use_cpu_adam flag for GRPO fast training#1737

Add use_cpu_adam flag for GRPO fast training#1737
Chessing234 wants to merge 3 commits into
allenai:mainfrom
Chessing234:fix/grpo-cpu-adam

Chessing234 commented Jun 24, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 24, 2026

Uh oh!

gemini-code-assist Bot Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

Chessing234 commented Jun 24, 2026

Summary

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant