Gymnasium HardMaze

A maze navigation simulator for reinforcement learning research, compatible with the Gymnasium API.

Overview

Gymnasium HardMaze is a reimplementation of the 'hardmaze' environment used in several neuroevolution research papers. It provides a platform for training and evaluating reinforcement learning agents in navigation tasks of varying complexity.

Key features:

XML-based maze configuration
Fully compatible with Gymnasium API
Deterministic when seeded for reproducible research
Visualizations for debugging and demonstrations

Installation

# Basic installation
pip install gymnasium-hardmaze

# For development
git clone https://github.com/Teaspoon-AI/gymnasium-hardmaze.git
cd gymnasium-hardmaze
pip install -e .

Usage

Basic Example

import gymnasium as gym
import gymnasium_hardmaze
import numpy as np

# Create default maze environment
env = gym.make("HardMaze-v0", render_mode="human")

# Reset the environment
observation, info = env.reset(seed=42)

for _ in range(1000):
    # Take a random action
    action = env.action_space.sample()

    # Step the environment
    observation, reward, terminated, truncated, info = env.step(action)

    # Check if episode is done
    if terminated or truncated:
        observation, info = env.reset()

env.close()

Keyboard Control Example

The package includes a keyboard control script for testing environments:

# Run keyboard agent in the hard maze environment
python -m gymnasium_hardmaze.examples.keyboard_agent

Available Environments

Environment ID	Description
`HardMaze-v0`	Complex maze with walls and multiple points of interest

Observation Space

The observation space is a vector containing:

Normalized rangefinder readings (distance to walls)
Radar readings (binary detection of goal)

Action Space

The action space is a 3-dimensional continuous space:

[left_motor, forward, right_motor] with values between 0 and 1

Rewards

The default reward function prioritizes reaching the goal, with partial rewards for visiting points of interest along the way.

Similar Projects

Several other projects provide maze navigation environments for robotics and AI research:

Kheperax

Kheperax is a JAX-based reimplementation of the fastsim simulator that simulates Khepera-like robots in 2D mazes. Key differences from gymnasium-hardmaze:

Hardware acceleration: Fully implemented in JAX for GPU/TPU acceleration and massive parallelization
Robot model: Simulates circular robots with 2 wheels, configurable laser and bumper sensors
Optimization focus: Directly compatible with QDax library for Quality-Diversity optimization
Performance: Designed for high-throughput evolutionary algorithms and population-based methods

fastsim_gym

A Gym wrapper for the pyfastsim simulator, also implementing Lehman & Stanley's hard maze. Key differences:

Simulation backend: Uses the pyfastsim C++ simulator for physics simulation
Robot configuration: Features 3 lasers at specific angles (-π/4, 0, π/4) and two bumpers
Map format: Requires binary PBM format maps with specific size constraints
API compatibility: Uses the older Gym API rather than the newer Gymnasium standard

How gymnasium-hardmaze differs:

Modern API: Built specifically for the Gymnasium API with full compatibility
Pure Python: No external simulator dependencies, making installation and debugging easier
Research focus: Carefully ported from ES-HyperNEAT codebase for accurate replication

Citation

If you use this software in your research, please cite:

@software{gymnasium-hardmaze,
  author = {Stefano Palmieri},
  title = {HardMaze: A Gymnasium-compatible Implementation of hardmaze environment},
  url = {https://github.com/Teaspoon-AI/gymnasium-hardmaze},
  year = {2025},
}

Acknowledgements

The code in this project is derived from the original source code used in Sebastian Risi's 2011 ES-HyperNEAT paper.

@InProceedings{risi:gecco2011,
  author       = "Sebastian Risi and Kenneth O. Stanley",
  title        = "Enhancing ES-HyperNEAT to Evolve More Complex Regular Neural Networks",
  booktitle    = "Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2010)",
  year         = 2011,
  publisher    = "ACM",
  url          = "http://eplex.cs.ucf.edu/papers/risi_gecco11.pdf"
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github		.github
docs		docs
src/gymnasium_hardmaze		src/gymnasium_hardmaze
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.rst		CODE_OF_CONDUCT.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
SUBMISSION.md		SUBMISSION.md
hardmaze-text.png		hardmaze-text.png
pyproject.toml		pyproject.toml
setup.py		setup.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gymnasium HardMaze

Overview

Installation

Usage

Basic Example

Keyboard Control Example

Available Environments

Observation Space

Action Space

Rewards

Similar Projects

Kheperax

fastsim_gym

How gymnasium-hardmaze differs:

Citation

Acknowledgements

About

Uh oh!

Releases 2

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Gymnasium HardMaze

Overview

Installation

Usage

Basic Example

Keyboard Control Example

Available Environments

Observation Space

Action Space

Rewards

Similar Projects

Kheperax

fastsim_gym

How gymnasium-hardmaze differs:

Citation

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages