Claw-R1 relies on verl for the training backend. Follow the steps below to set up your environment.
- Conda (recommended for environment management)
- Python 3.10+
- CUDA (for GPU training)
conda create -n clawr1 python=3.10 -y
conda activate clawr1Install the nightly version of veRL from source (recommended):
git clone https://github.com/volcengine/verl && cd verl
pip install --no-deps -e .
cd ..After installation, ensure your environment is ready:
python -c "import verl; print('veRL installed successfully')"