imitation-learning-tools

Scaffold for a modular imitation learning toolkit with dataset loaders, retargeting, and CLI utilities.

Installation

We recommend using the conda-forge channel for managing dependencies. For a faster experience, use mamba to create the environment and uv to install packages.

Create and activate a conda environment:

mamba create -n iltools python=3.11 -c conda-forge
conda activate iltools

Install uv:
```
pip install uv
```
Install the project:

Base install (without Loco-MuJoCo):
```
uv pip install -e .
```
If you need Loco-MuJoCo support (dataset loader, tests, viewers), install the optional extra:
```
uv pip install -e .[loco-mujoco]
```
Alternatively with pip:
```
pip install -e .[loco-mujoco]
```
Note: Using the -e flag installs the project in "editable" mode, which is recommended for development.

Dataset Structure

The structure of a dataset is as follows:

Dataset/
├── motion1/
│   ├── trajectory1/
│   │   ├── observations/
│   │   │   ├── qpos
│   │   │   ├── qvel
│   │   │   └── ...
│   │   ├── actions/
│   │   │   ├── target_joint_pos
│   │   │   └── ...
│   │   ├── rewards
│   │   └── infos
│   ├── trajectory2/
│   │   ├── observations/
│   │   │   ├── qpos
│   │   │   ├── qvel
│   │   │   └── ...
│   │   ├── actions/
│   │   │   ├── target_joint_pos
│   │   │   └── ...
│   │   ├── rewards
│   │   └── infos
│   └── ...
├── motion2/
│   ├── trajectory1/
│   │   ├── observations/
│   │   │   ├── qpos
│   │   │   ├── qvel
│   │   │   └── ...
│   │   ├── actions/
│   │   │   ├── target_joint_pos
│   │   │   └── ...
│   │   ├── rewards
│   │   └── infos
│   └── ...
└── ...

E.g. We currently support loco-mujoco dataset with various motions such as default-walk with 1 trajectory. Install the optional extra to enable this loader.

Usage

Here is an example of how to use the TrajectoryDatasetManager to load and step through a dataset, inspired by tests/datasets/test_integration.py.

import torch
from tensordict import TensorDict

from iltools.datasets.dataset_manager import TrajectoryDatasetManager

# Mock configuration for demonstration
class MockConfig:
    def __init__(self, **kwargs):
        for key, value in kwargs.items():
            setattr(self, key, value)

# 1. Configure the dataset manager
#    Replace 'path/to/your/dataset' with the actual path to your Zarr dataset directory.
#    This directory should contain 'trajectories.zarr' and 'metadata.json'.
cfg = MockConfig(
    dataset_path='path/to/your/dataset',
    assignment_strategy='random',  # or 'sequential', 'round_robin', 'curriculum'
    window_size=128
)

# 2. Initialize the manager for a specified number of environments
num_envs = 16
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
manager = TrajectoryDatasetManager(cfg, num_envs, device)

# 3. Reset trajectories at the beginning of a training session
manager.reset_trajectories()

# 4. Fetch reference data in a loop (e.g., inside your RL environment's step function)
for _ in range(1000):  # Simulate 1000 steps
    # Get a batch of reference data for the current timestep
    reference_data = manager.get_reference_data()

    # The data is returned as a TensorDict for easy access
    # Shape: [num_envs, ...]
    com_positions = reference_data["com_pos"]
    joint_positions = reference_data["joint_pos"]

    # Your agent would use this data to compute actions...
    # print(f"Step {_}: COM Position Batch Shape: {com_positions.shape}")

    # To reset specific environments that have completed their episode:
    # done_env_ids = torch.tensor([2, 5], device=device) # Example IDs
    # manager.reset_trajectories(done_env_ids)

Testing

To verify that the loco_mujoco dataset loader is working correctly, you can run the specific test file for it. This is currently the recommended test to run.

First, ensure you have the necessary test dependencies installed (including the optional extra):

uv pip install pytest numpy torch omegaconf zarr mujoco
uv pip install -e .[loco-mujoco]

Then, run the following command from the root of the project:

pytest tests/datasets/test_loco_mujoco_loader.py

To visually inspect the loaded trajectories, you can use the --visualize flag. This will open a MuJoCo viewer and replay the first trajectory.

pytest tests/datasets/test_loco_mujoco_loader.py --visualize

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
docs		docs
iltools		iltools
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

imitation-learning-tools

Installation

Dataset Structure

Usage

Testing

About

Uh oh!

Releases

Packages

Languages

License

GTLIDAR/ImitationLearningTools

Folders and files

Latest commit

History

Repository files navigation

imitation-learning-tools

Installation

Dataset Structure

Usage

Testing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages