ArmPi - Imitation Learning for Robot Arm Control

An imitation learning system for the HiWonder ArmPi robot arm that collects human demonstrations, trains CNN+MLP policies, and deploys them for autonomous control.

Project Overview

This project enables a robot arm to learn manipulation tasks from human demonstrations:

Collect - A human operator controls the robot arm via keyboard while camera observations and joint states are recorded
Convert - Raw ROS bag recordings are converted to HDF5 datasets for training
Train - CNN+MLP or Diffusion Policy models are trained on the demonstration data
Deploy - Trained models run as a ROS service, providing real-time action inference for autonomous control

Tech Stack

Category	Technologies
Robotics	ROS 1 (Noetic), Inverse Kinematics
Languages	C++17 (control), Python 3.10 (ML/inference)
ML	PyTorch 2.1, TorchVision, Diffusion Policy
Data	HDF5, Pandas, OpenCV
Infrastructure	Docker (GPU), Conda, SDL2

Repository Structure

ros/
  armpi/                  # Low-level robot control
    armpi_servo/          #   Servo drivers & IK action server
    armpi_control/        #   Main control node
    armpi_chassis/        #   Chassis (mobile base) control
  myapp/
    armpi_controller/     #   Controller abstraction (keyboard / AI mode)
    collect_data/         #   Data collection from human demonstrations
    ai_model_service/     #   ML inference ROS service
  share/
    armpi_operation_msgs/ #   Custom ROS message definitions
scripts/
  convert/                # ROS bag to HDF5 conversion
  create_video.py         # Generate videos from collected data
  docker_run.sh           # Launch Docker development container
datasets/                 # Collected demonstration data (not tracked)
models/                   # Trained model checkpoints (not tracked)

Setup

Prerequisites

NVIDIA GPU with CUDA support
Docker with NVIDIA Container Toolkit
HiWonder ArmPi robot on the same network (default: 192.168.149.1)

Docker Environment

The Docker container provides ROS Noetic, PyTorch, and all dependencies pre-installed.

# Build the Docker image
docker build -t armpi_env .

# Launch the container (mounts ros/myapp, ros/share, datasets, models)
./scripts/docker_run.sh

The container runs with --gpus all and --net=host for GPU access and ROS networking.

Inside the Container

# Build the ROS workspace
cd ~/ros_ws
catkin_make
source devel/setup.bash

Conda Environment (for data conversion on host)

conda env create -f environment.yml
conda activate armpi_env

Usage

1. Start Robot Control Nodes (on Raspberry Pi)

roslaunch armpi start_armpi_control.launch

2. Collect Demonstration Data

Launch the controller in keyboard mode and record ROS bags of human demonstrations.

3. Convert Data

Convert recorded ROS bags to HDF5 format for training:

conda activate armpi_env
python scripts/convert/main.py

4. Train a Model

Training is handled in the separate IL (Imitation Learning) repository. See Related Repositories.

5. Deploy for Autonomous Control

# Inside Docker container
roslaunch myapp run_ai_controller.launch model_name:=<your_model>

This launches the inference server and controller in AI mode.

Results

Trained on 30 human demonstrations of an object grasping task
Implemented and compared 3 algorithms: MLP (baseline), ACT, and Diffusion Policy
The trained policy successfully performs autonomous grasping as shown in the demo below

Demo

Autonomous Control (Inference)

armpi_inference.mp4

Teleoperation (Data Collection)

armpi_teleop.mp4

Architecture

ROS Node Communication

graph LR
  subgraph Sensor
    CAM[/usb_cam/]
    JS[/joint_states/]
  end

  subgraph Controller ["armpi_controller (generic_robot_controller)"]
    KB[Keyboard Mode]
    AI[AI Mode]
  end

  subgraph Inference ["ai_model_service (imitation_service_server)"]
    MODEL[CNN+MLP / Diffusion Policy]
  end

  subgraph Hardware Control
    CTRL["armpi_control (armpi_control_main)"]
    IK["armpi_servo (ik_action_server)"]
    CHASSIS[armpi_chassis]
  end

  subgraph Data Collection ["collect_data"]
    COLLECT[CollectData Node]
    BAG[(ROS Bag)]
  end

  CAM -- "/usb_cam/image_raw" --> AI
  JS -- "/joint_states" --> AI
  AI -- "predict_action srv" --> MODEL
  MODEL -- "RobotCommand" --> AI

  KB -- "armpi_command" --> CTRL
  AI -- "armpi_command" --> CTRL

  CTRL -- "compute_arm_ik_and_move srv" --> IK
  CTRL -- "set_velocity" --> CHASSIS
  IK -- "multi_id_pos_dur" --> SERVO[Servos]

  CAM -- "/usb_cam/image_raw" --> COLLECT
  JS -- "/joint_states" --> COLLECT
  CTRL -- "get_command" --> COLLECT
  COLLECT --> BAG

Data Pipeline

flowchart LR
  subgraph Collect ["1. Collect"]
    HUMAN[Human Operator] --> |Keyboard Control| CONTROLLER[armpi_controller]
    CAMERA[Camera] --> RECORD[collect_data]
    JOINTS[Joint States] --> RECORD
    CONTROLLER --> |get_command| RECORD
    RECORD --> ROSBAG[(ROS Bag)]
  end

  subgraph Convert ["2. Convert"]
    ROSBAG --> SCRIPT["convert_bag_to_h5.py"]
    SCRIPT --> HDF5[(HDF5 Dataset)]
  end

  subgraph Train ["3. Train"]
    HDF5 --> TRAINING["Model Training<br/>(IL repository)"]
    TRAINING --> CKPT[(Model Checkpoint)]
  end

  subgraph Deploy ["4. Deploy"]
    CKPT --> SERVER["imitation_service_server"]
    SERVER --> |predict_action| AICTRL[AI Controller]
    AICTRL --> |armpi_command| ROBOT[Robot Arm]
  end

Related Repositories

IL (Imitation Learning) - Model training code, network architectures, and training pipelines. See this repository for details on CNN+MLP and Diffusion Policy implementations.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.github/workflows		.github/workflows
ros		ros
scripts		scripts
.gitignore		.gitignore
.gitmodules		.gitmodules
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_ja.md		README_ja.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ArmPi - Imitation Learning for Robot Arm Control

Project Overview

Tech Stack

Repository Structure

Setup

Prerequisites

Docker Environment

Inside the Container

Conda Environment (for data conversion on host)

Usage

1. Start Robot Control Nodes (on Raspberry Pi)

2. Collect Demonstration Data

3. Convert Data

4. Train a Model

5. Deploy for Autonomous Control

Results

Demo

Autonomous Control (Inference)

Teleoperation (Data Collection)

Architecture

ROS Node Communication

Data Pipeline

Related Repositories

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ArmPi - Imitation Learning for Robot Arm Control

Project Overview

Tech Stack

Repository Structure

Setup

Prerequisites

Docker Environment

Inside the Container

Conda Environment (for data conversion on host)

Usage

1. Start Robot Control Nodes (on Raspberry Pi)

2. Collect Demonstration Data

3. Convert Data

4. Train a Model

5. Deploy for Autonomous Control

Results

Demo

Autonomous Control (Inference)

Teleoperation (Data Collection)

Architecture

ROS Node Communication

Data Pipeline

Related Repositories

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages