GitHub - Nawfal-AI/Turret_replication

# TURRET: Transferable Unified Robot Representation with Graph Neural Networks

> **TURRET**: A Graph Neural Network framework for adaptive multi-source cross-domain transfer learning in robotic control.

## 📖 Overview

TURRET is a novel framework that enables effective knowledge transfer across different robot morphologies and task domains using Graph Neural Networks (GNNs). This implementation reproduces the core contributions of the original paper:

- **Unified Semantic Space**: Projects states from different tasks into a common embedding space
- **Adaptive Transfer**: Dynamically weights source policies based on state-level semantic similarity  
- **GNN-based Policy**: Structured policy networks that explicitly model robot morphology
- **Gradual Independence**: Progressive transition from transfer learning to independent learning

## 🎯 Key Features

- 🕸️ **Morphology-aware GNNs**: Explicitly model robot structure as graphs
- 🔄 **Multi-source Transfer**: Combine knowledge from multiple source policies
- 🎯 **State-level Adaptation**: Dynamic transfer weights based on current state
- 📈 **Progressive Learning**: Smooth transition from transfer to independent learning
- 🧪 **Comprehensive Evaluation**: Size transfer, morphology transfer, and ablation studies

## 🚀 Quick Start

### Installation & Setup

# Clone repository
git clone https://github.com/your-username/turret-replication.git
cd turret-replication

# Create environment (recommended)
conda create -n turret python=3.8
conda activate turret

# Install dependencies
pip install -r requirements.txt

# Download pre-trained models
python scripts/download_pretrained.py

# Or pre-train source policies yourself
python experiments/pretrain_source.py

Run Experiments

# Quick demo (2 seeds, 100 episodes)
python scripts/run_all_experiments.py --num_seeds 2 --total_episodes 100

# Full paper replication (5 seeds, 500 episodes)
python scripts/run_all_experiments.py --num_seeds 5 --total_episodes 500

# Run specific experiments only
python scripts/run_all_experiments.py --experiments size morphology

# With GPU acceleration
python scripts/run_all_experiments.py --device cuda

Basic Usage

from configs.base_config import TURRETConfig
from experiments.paper_experiments import PaperExperimentReplicator

# Configure experiment
config = TURRETConfig(
    device="cuda",
    total_episodes=500,
    num_seeds=5
)

# Run all paper experiments
replicator = PaperExperimentReplicator(config)
results = replicator.run_all_experiments()

🔬 Experiment Guide

Running Transfer Experiments

# Run transfer experiments directly
python experiments/transfer_experiment.py

# Run specific experiment types
python scripts/run_all_experiments.py --experiments size          # Only size transfer
python scripts/run_all_experiments.py --experiments size morphology  # Size and morphology

Result Analysis

View Experiment Results

# Results are saved in:
ls data/paper_results/

# Main files:
# - paper_experiments_YYYYMMDD_HHMMSS.json       # Raw results
# - training_statistics_YYYYMMDD_HHMMSS.json     # Training statistics  
# - analysis/comprehensive_analysis_report.json  # Analysis report

Generate Analysis Reports

from analysis.result_analyzer import ResultAnalyzer

analyzer = ResultAnalyzer("data/paper_results")
analysis = analyzer.generate_comprehensive_analysis(results)

Visualize Results

from experiments.visualization.advanced_visualizer import AdvancedVisualizer

visualizer = AdvancedVisualizer()
visualizer.plot_transfer_dynamics(results)

Performance Optimization

GPU Acceleration

# Run with GPU
python scripts/run_all_experiments.py --device cuda

# Run with multiple GPUs
python scripts/run_all_experiments.py --device cuda --num_processes 4

Distributed Training

from optimization.distributed_trainer import DistributedTURRETTrainer

dist_trainer = DistributedTURRETTrainer(config)

Performance Analysis

from optimization.performance_optimizer import PerformanceOptimizer

optimizer = PerformanceOptimizer(config)
stats = optimizer.optimize_gnn_forward(model, node_observations, morphology_graph)

🏗️ Architecture

Core Components

TURRET/
├── configs/
│   ├── base_config.py              # Main configuration dataclass (TURRETConfig)
│   └── environment_config.py       # Environment-specific settings
├── models/
│   ├── policies/
│   │   ├── gnn_structured_policy.py      # Full GNN policy (production version)
│   │   └── structured_policy.py          # Simplified policy (testing version)
│   ├── networks/
│   │   ├── attention_propagation.py      # Multi-head GNN layers
│   │   ├── set_transformer.py           # State embedding via attention
│   │   ├── input_network.py             # Node observation processing
│   │   ├── output_network.py            # Action distribution prediction
│   │   └── base_networks.py             # Base neural network components
│   ├── morphology.py               # Robot graph structure definitions
│   └── components/
│       └── distributions.py        # Probability distributions for actions
├── training/
│   ├── trainers/
│   │   ├── transfer_trainer.py           # Complete TURRET training system
│   │   ├── ppo_trainer.py               # Base PPO implementation
│   │   └── base_trainer.py              # Abstract trainer interface
│   ├── buffers.py                  # Experience replay buffers
│   └── optimizers.py               # Gradient management and schedulers
├── environments/
│   ├── tasks/
│   │   ├── centipede.py                 # Centipede-n multi-legged robots
│   │   └── standard_robots.py           # MuJoCo standard robots
│   ├── base_env.py               # Abstract environment interface
│   └── mujoco_wrapper.py         # MuJoCo environment wrapper
├── transfer/
│   ├── semantic_space.py         # Unified state embedding space
│   ├── weight_calculator.py      # Adaptive transfer weight computation
│   ├── lateral_connections.py    # Knowledge fusion mechanisms
│   ├── independence.py           # Gradual independence scheduler
│   └── base_transfer.py          # Base class for transfer components
├── experiments/
│   ├── runners/
│   │   ├── size_transfer.py             # Size transfer experiments
│   │   ├── morphology_transfer.py       # Morphology transfer experiments
│   │   └── base_runner.py               # Experiment runner base class
│   ├── evaluation/
│   │   ├── evaluator.py                 # Experiment evaluation
│   │   ├── metrics.py                   # Performance metrics
│   │   └── baseline_models.py           # Baseline method implementations
│   ├── visualization/
│   │   ├── advanced_visualizer.py       # Interactive visualizations
│   │   ├── tsne_visualizer.py          # Dimensionality reduction
│   │   └── trajectory_plot.py           # Training trajectory plotting
│   ├── paper_experiments.py      # Unified experiment entry point
│   ├── transfer_experiment.py    # Transfer learning experiments
│   └── pretrain_source.py        # Source policy pre-training
├── analysis/
│   └── result_analyzer.py        # Comprehensive result analysis
├── optimization/
│   ├── performance_optimizer.py  # Performance optimization tools
│   └── distributed_trainer.py    # Distributed training support
├── scripts/
│   ├── run_all_experiments.py    # Main experiment runner
│   └── download_pretrained.py    # Pre-trained model downloader
└── utils/
    ├── file_utils.py             # Checkpoint and file management
    └── logging_utils.py          # Logging and training statistics

Key Algorithms

Graph-based Policy Representation

# Robot morphology as graph
morphology_graph = MorphologyGraph("Humanoid")
policy = GNNStructuredPolicyNetwork(config)

Adaptive Transfer Weights

# Compute transfer weights based on state similarity
weights = weight_calculator.compute_transfer_weights(
    target_state, source_states
)

Gradual Independence

# Progressive independence factor
p = independence_scheduler.get_current_p()
fused_output = p * target + (1-p) * transferred

📊 Experiments

Supported Transfer Scenarios

Experiment Type	Source Tasks	Target Tasks	Description
Size Transfer	HalfCheetah, Ant	Humanoid, Walker2d	Small→Large robot transfer
Morphology Transfer	Quadruped→Biped	Various combinations	Cross-morphology transfer
Ablation Studies	-	-	Component importance analysis
Baseline Comparison	PPO, CAT, NerveNet	Standard tasks	Method performance comparison

Evaluation Metrics

Performance: Mean reward, learning speed, sample efficiency
Transfer Effectiveness: Weight distributions, semantic distances
Statistical Significance: Confidence intervals, effect sizes

📈 Results

Performance Comparison

Key Findings

Effective Cross-Domain Transfer: TURRET successfully transfers knowledge across different robot morphologies
Adaptive Weighting: State-level similarity metrics outperform fixed weighting schemes
Scalability: GNN-based policies scale effectively to complex robot structures
Progressive Learning: Gradual independence prevents negative transfer and improves final performance

🛠️ Development

Adding New Experiments

Create new runner in experiments/runners/
Register experiment in paper_experiments.py
Update configuration classes and run scripts

Extending Components

Add new components in appropriate modules
Ensure compatibility with TURRETConfig for configuration
Update import paths and dependencies

Testing & Validation

The project includes comprehensive testing tools:

# Full system verification
python verify_phase8.py

# Component interface testing
python tests/test_component_interfaces.py

# Experiment replication testing  
python tests/test_experiment_replication.py

# Performance benchmarking
python tests/performance_benchmark.py

# Final validation
python verification/final_validation.py

# Paper experiment test
python tests/test_run_paper_experiments.py

Code Structure

The project follows a modular architecture:

Config-driven: All experiments configured via TURRETConfig dataclass
Modular components: Easy to extend or replace components
Comprehensive testing: Each phase has verification scripts
Type hints: Full type annotation for better development experience

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Original TURRET paper authors for the innovative research
MuJoCo team for the physics simulation environment
PyTorch team for the deep learning framework
HuggingFace for pre-trained model hosting

Note: This is a replication project for research purposes. Performance may vary based on hardware and specific experimental setup.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Run Experiments

Basic Usage

🔬 Experiment Guide

Running Transfer Experiments

Result Analysis

View Experiment Results

Generate Analysis Reports

Visualize Results

Performance Optimization

GPU Acceleration

Distributed Training

Performance Analysis

🏗️ Architecture

Core Components

Key Algorithms

📊 Experiments

Supported Transfer Scenarios

Evaluation Metrics

📈 Results

Performance Comparison

Key Findings

🛠️ Development

Adding New Experiments

Extending Components

Testing & Validation

Code Structure

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
analysis		analysis
experiments		experiments
models		models
optimization		optimization
scripts		scripts
tests		tests
training		training
transfer		transfer
turret_replication.egg-info		turret_replication.egg-info
utils		utils
verification		verification
PythonPath.env		PythonPath.env
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Run Experiments

Basic Usage

🔬 Experiment Guide

Running Transfer Experiments

Result Analysis

View Experiment Results

Generate Analysis Reports

Visualize Results

Performance Optimization

GPU Acceleration

Distributed Training

Performance Analysis

🏗️ Architecture

Core Components

Key Algorithms

📊 Experiments

Supported Transfer Scenarios

Evaluation Metrics

📈 Results

Performance Comparison

Key Findings

🛠️ Development

Adding New Experiments

Extending Components

Testing & Validation

Code Structure

📄 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages