🧠 RL Research Platform (Gen 2)

An industrial-grade Research & Operations (MLOps) platform tailored for Reinforcement Learning (RL) and Multi-Agent RL (MARL).

Designed for researchers who need reproducible experiments, automated evaluation, and deep observability.

✨ Key Features

🏋️ Training & Scheduling

Hybrid Engine Support: Native support for Stable-Baselines3 (Single-Agent) and Ray RLLib (Multi-Agent).
Git-Ops Workflow: Run experiments directly from your Git commits. The platform records commit hashes for 100% reproducibility.
Config Diff: Instantly visualize hyperparameter differences between any two runs.

👁️ Observability

TensorBoard Integration: Built-in TensorBoard proxy for deep gradient/loss analysis.
Smart Video Gallery: Automatically records and organizes replay videos from training checkpoints.
Real-time Metrics: Live streaming of Reward, Entropy, and Win Rate with downsampling for performance.

🧪 Evaluation & Analysis

Matrix Evaluation: Automated "League Table" generation. Run A vs Run B evaluations with heatmaps and Elo scoring.
Repro Bundle: One-click export of reproduce.sh, config.yaml, and README.md for open-sourcing your results.

🚀 Quick Start

Option A: Docker Deployment (Recommended)

Best for: Servers with Docker installed. Zero configuration required.

# 1. Start the platform (Builds everything automatically)
docker compose up -d --build

# 2. View logs
docker compose logs -f

Access at: http://localhost:8000

Option B: User-Space Deployment (No Docker/Sudo)

Best for: Shared HPC clusters, School servers without root access.

Step 1: Local Preparation (On your Mac/PC) Build the frontend assets locally to avoid installing Node.js on the server.

cd rl-research-platform
npm ci && npm run build
# Now upload the entire project (including the new 'dist' folder) to your server.

Step 2: Server Launch

# 1. Grant execution permissions
chmod +x start-linux.sh
chmod +x start-mac.sh

# 2. Start the platform (one-click setup + tests)
# Linux:
./start-linux.sh
# macOS:
./start-mac.sh

Access at: http://localhost:8000

Option C: Backend-Only Quick Start (Offline Friendly)

Best for: Existing Python environment already has dependencies, or network-restricted servers.

chmod +x scripts/backend-local-up.sh
cp apps/portal-backend/.env.example apps/portal-backend/.env
./scripts/backend-local-up.sh

Notes:

Uses SQLite by default (apps/portal-backend/rl_platform.db).
Auto-detects usable Python interpreter (BACKEND_PYTHON, .venv, then conda env).
Skips heavy Orbit/extra runtime installation from start-linux.sh.

Option D: One-Click Acceptance Check (For Demo Readiness)

Best for: Verifying "can run on this machine" before recording or demo.

chmod +x scripts/acceptance-check.sh
./scripts/acceptance-check.sh

Checks:

docker compose config validation
frontend build
backend startup + /healthz smoke

What the start scripts do

The start-*.sh scripts are fully automated and will:

Build the frontend + generate OpenAPI clients
Create venv and install backend/runner dependencies (or reuse an existing conda env if available)
Optionally install Miniconda + OrbitZoo + Orekit data (INSTALL_ORBIT_RUNTIME=1)
Optionally install common RL env extras (INSTALL_RL_EXTRAS=1)
Initialize DB and seed defaults
Optionally seed comprehensive MARL envs (SEED_MARL_ENVS=1)
Optionally run backend tests (RUN_TESTS=1)
Start TensorBoard + backend

You can skip heavy steps if needed:

SEED_MARL_ENVS=0 RUN_TESTS=0 INSTALL_ORBIT_RUNTIME=0 INSTALL_RL_EXTRAS=0 ./start-linux.sh

📂 Project Structure

rl-research-platform/
├── apps/
│   ├── portal-backend/       # FastAPI Backend & Orchestrator
│   │   ├── app/              # Core Logic (API, DB, Services)
│   │   └── runner/           # Training Runner (Executes SB3/RLLib)
│   └── portal-frontend/      # React Frontend (Vite)
├── scripts/
│   ├── seed-full.sh          # Database Seeding (Default Envs/Algos)
│   ├── backend-local-up.sh   # Backend quick start (offline-friendly)
│   ├── start-linux.sh        # Unified Startup Script (Linux)
│   └── start-mac.sh          # Unified Startup Script (macOS)
├── docs/                     # Documentation
└── requirements.txt          # Top-level deps

🔬 Research Workflow

Develop: Write your custom environment or algorithm wrapper in your local Git repository.
Push: Commit your changes to GitHub/GitLab.
Submit: In the platform, create a Job pointing to your Git Repo URL.
Observe: Watch live TensorBoard plots and video replays.
Evaluate: Select your best checkpoints and run a "Matrix Job" to benchmark against baselines.
Publish: Click "Download Repro Bundle" to get a clean, shippable zip file for your paper.

🔧 Extending the Platform

See Developer Guide for details on:

Adding custom Gym/PettingZoo environments.
Registering new Algorithms.
Plugin system for custom rewards/loggers.

🔥 New: LLM Integration Guide - How to use GPT-4/Claude to auto-generate code for this platform.

Built for the RL Community.

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
.cursor/skills		.cursor/skills
.github/workflows		.github/workflows
apps		apps
components		components
docs		docs
examples/plugins/custom_wrapper		examples/plugins/custom_wrapper
infra/determined		infra/determined
pages		pages
scripts		scripts
services		services
tests		tests
web		web
.gitattributes		.gitattributes
.gitignore		.gitignore
.modelscope.yml		.modelscope.yml
=0.4.0		=0.4.0
=3.0.0		=3.0.0
App.tsx		App.tsx
Dockerfile		Dockerfile
Dockerfile.modelscope		Dockerfile.modelscope
GEMINI.md		GEMINI.md
MODELSCOPE_DEPLOY.md		MODELSCOPE_DEPLOY.md
README.md		README.md
README_CN.md		README_CN.md
README_HACKATHON_HANDOVER.md		README_HACKATHON_HANDOVER.md
README_MODELSCOPE.md		README_MODELSCOPE.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
check_deployment.sh		check_deployment.sh
constraints.txt		constraints.txt
docker-compose.determined.yml		docker-compose.determined.yml
docker-compose.yml		docker-compose.yml
idea.txt		idea.txt
index.css		index.css
index.html		index.html
index.tsx		index.tsx
metadata.json		metadata.json
ms_deploy.json		ms_deploy.json
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
requirements.txt		requirements.txt
start-linux.sh		start-linux.sh
start-mac.sh		start-mac.sh
tailwind.config.js		tailwind.config.js
tmp.py		tmp.py
tsconfig.json		tsconfig.json
types.ts		types.ts
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 RL Research Platform (Gen 2)

✨ Key Features

🏋️ Training & Scheduling

👁️ Observability

🧪 Evaluation & Analysis

🚀 Quick Start

Option A: Docker Deployment (Recommended)

Option B: User-Space Deployment (No Docker/Sudo)

Option C: Backend-Only Quick Start (Offline Friendly)

Option D: One-Click Acceptance Check (For Demo Readiness)

What the start scripts do

📂 Project Structure

🔬 Research Workflow

🔧 Extending the Platform

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 RL Research Platform (Gen 2)

✨ Key Features

🏋️ Training & Scheduling

👁️ Observability

🧪 Evaluation & Analysis

🚀 Quick Start

Option A: Docker Deployment (Recommended)

Option B: User-Space Deployment (No Docker/Sudo)

Option C: Backend-Only Quick Start (Offline Friendly)

Option D: One-Click Acceptance Check (For Demo Readiness)

What the start scripts do

📂 Project Structure

🔬 Research Workflow

🔧 Extending the Platform

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages