[feat][Draft] Refactor slime to work with custom Megatron and vllm version by knlnguyen1802 · Pull Request #7 · SamitHuang/slime

knlnguyen1802 · 2026-04-01T07:45:32Z

This can run vllm 0.17.0 and Megatron-LM 0.16.1

How to run this
Base image for docker
nvcr.io/nvidia/pytorch:26.01-py3

Download Megatron-LM 0.16.1
https://github.com/NVIDIA/Megatron-LM/releases/tag/core_v0.16.1
Unzip and rename it to Megatron-LM folder and put under /root/

pip install vllm==0.17.0

Clone the slime repo

cd slime
pip install -e .

For compatible

pip install "numpy<2"
pip install torch_memory_saver

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

gemini-code-assist

Code Review

This pull request implements a backend separation plan to decouple the framework from a hard dependency on sglang, enabling support for vLLM. Key changes include renaming backend-agnostic arguments like --server-concurrency, introducing lazy imports for sglang-specific modules, and providing local fallbacks for weight-synchronization utilities to support environments without sglang. The PR also updates Megatron-Core parameter gathering to support strided layouts and includes a comprehensive RFC detailing the refactoring strategy. A critical issue was identified in the argument parsing logic where Hugging Face checkpoint validation was inadvertently disabled by hardcoding a skip flag to true.

slime/utils/arguments.py

SamitHuang

clear design

SamitHuang · 2026-04-01T09:18:57Z

rfc-rollout-backend-separation-plan.md

move it to design doc

SamitHuang · 2026-04-01T09:22:08Z

slime/rollout/sglang_rollout.py


        self.semaphore = asyncio.Semaphore(
-            args.sglang_server_concurrency * args.rollout_num_gpus // args.rollout_num_gpus_per_engine
+            args.server_concurrency * args.rollout_num_gpus // args.rollout_num_gpus_per_engine


it's more clear to name as rollout_server_concurrency ?

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

knlnguyen1802 added 11 commits March 30, 2026 16:52

Plan refactor vllm/sglang

c154078

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Code implemented

1a2dcf5

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Fix bug

8addb37

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Fix bug

9689a97

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Fix bug

3753432

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Fix port

f1e7554

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Fix config

91cc780

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Fix bug MOE weight sync

be1ecd4

Fix bug vllm transfer weight

e7216d8

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Fix weight sync

2343647

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Fix

8498b7b

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

gemini-code-assist bot reviewed Apr 1, 2026

View reviewed changes

slime/utils/arguments.py Outdated Show resolved Hide resolved

knlnguyen1802 changed the title ~~[feat] Refactor slime to work with custom Megatron and vllm version~~ [feat][Draft] Refactor slime to work with custom Megatron and vllm version Apr 1, 2026

SamitHuang reviewed Apr 1, 2026

View reviewed changes

knlnguyen1802 added 2 commits April 2, 2026 13:26

Fix config

8a41184

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Change name config

acc9690

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat][Draft] Refactor slime to work with custom Megatron and vllm version #7

[feat][Draft] Refactor slime to work with custom Megatron and vllm version #7
knlnguyen1802 wants to merge 13 commits intoSamitHuang:dev_vllmfrom
knlnguyen1802:refactor_dev

knlnguyen1802 commented Apr 1, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

SamitHuang left a comment •

edited

Loading

Uh oh!

SamitHuang Apr 1, 2026

Uh oh!

SamitHuang Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

knlnguyen1802 commented Apr 1, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

SamitHuang left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SamitHuang Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

SamitHuang Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SamitHuang left a comment •

edited

Loading