Agent Sandbox

Run AI coding agents in a locked-down local sandbox with:

Minimal filesystem access (only your repo + project-scoped agent state)
Proxy-enforced domain allowlist (mitmproxy sidecar blocks non-allowed domains)
Iptables firewall preventing direct outbound (all traffic must go through the proxy)
Reproducible environments (Debian container with pinned dependencies)

Target platform: Colima + Docker Engine on Apple Silicon. Should work with any Docker-compatible runtime.

What it does

Creates a sandboxed environment for AI coding agents (currently Claude Code, GitHub Copilot CLI, but more to come) that:

Routes all HTTP/HTTPS traffic through an enforcing proxy sidecar
Blocks requests to domains not on the allowlist (403 with domain name in response)
Blocks all direct outbound via iptables (prevents bypassing the proxy)
Runs as non-root user with limited sudo only for firewall initialization in entrypoint
Persists agent credentials and configuration in a Docker volume across container rebuilds

Supported agents

Agent	Status
Claude Code	✅ Stable
GitHub Copilot CLI	🧪 Preview
OpenAI Codex CLI	🧪 Preview

Runtime modes

The sandbox is implemented as a Docker Compose project with a two-container stack: the agent container and a sidecar proxy (mitmproxy). There are two main ways to run this stack:

CLI mode (recommended) stores the docker-compose project in the .agent-sandbox directory.

Devcontainer mode stores the docker-compose project in the .devcontainer directory alongside the devcontainer.json configuration.

Both modes store the network proxy policy file in the .agent-sandbox directory.

Quick start (macOS + Colima)

1. Install prerequisites

You need docker and docker-compose installed. So far we've tested with Colima + Docker Engine, but this should work with Docker Desktop for Mac or Podman as well. Instructions that follow are for Colima.

# colima for VM, docker packages for running containers in VM, yq is a dependency for agentbox cli
brew install colima docker docker-compose docker-buildx yq
colima start --cpu 4 --memory 8 --disk 60

Set your Docker credential helper to osxkeychain (not desktop) in ~/.docker/config.json.

2. Install agent-sandbox CLI

The CLI is a helper script and thin wrapper around docker-compose that simplifies the process of initializing and starting the sandbox.

Local install (recommended)

# Clone the repo
git clone https://github.com/mattolson/agent-sandbox.git

# Add agenbox bin directory to your path (add this to your .bashrc or .zshrc)
export PATH="$PWD/agent-sandbox/cli/bin:$PATH"

yq is required to edit compose files. Install with brew install yq.

Run through docker image

You can also run the cli through a published docker image if you don't want to install anything locally:

# Pull the image to local docker
docker pull ghcr.io/mattolson/agent-sandbox-cli

# Add to your .bashrc or .zshrc
alias agentbox='docker run --rm -it -v "/var/run/docker.sock:/var/run/docker.sock" -v"$PWD:$PWD" -w"$PWD" -e TERM -e HOME --network none ghcr.io/mattolson/agent-sandbox-cli'

Using the Docker image disables the editor integration (vi installed in the image will be used instead of your host editor).

The host environment variables will not be available inside the container, unless you forward them explicitly. This is important, because Docker Compose runs inside the container. HOME is already forwarded to handle common use cases.

The image runs as root, to avoid permission issues with the host Docker socket. On Colima file ownership is mapped automatically, on Linux you should add --user parameter accordingly.

3. Initialize the sandbox for your project

agentbox init

This prompts you to select the agent type and mode (CLI or devcontainer), then sets up the necessary configuration files and network policy. You can also pass flags to skip prompts:

agentbox init --agent claude --mode cli

Optional volume mounts (dotfiles, shell customizations, .git read-only, etc.) are included as commented-out entries in the generated compose file. Uncomment them as needed, or set AGENTBOX_* environment variables for scripted usage. See the CLI README for the full list of flags and environment variables.

4. Start the sandbox

CLI:

# Open a shell in the agent container
agentbox exec

# Start your agent cli (e.g. claude). Because you're in a sandbox, you can even try yolo mode!
claude --dangerously-skip-permissions

Devcontainer (VS Code / JetBrains):

VS Code:

Install the Dev Containers extension
Command Palette > "Dev Containers: Reopen in Container"

JetBrains (IntelliJ, PyCharm, WebStorm, etc.):

Open your project
From the Remote Development menu, select "Dev Containers"
Select the devcontainer configuration

5. Agent-specific setup

Follow the setup instructions specific to the agent image you are using:

Network policy

Network enforcement has two layers:

Proxy (mitmproxy sidecar) - Enforces a domain allowlist at the HTTP/HTTPS level. Blocks requests to non-allowed domains with 403.
Firewall (iptables) - Blocks all direct outbound from the agent container. Only the Docker host network is reachable, which is where the proxy sidecar runs. This prevents applications from bypassing the proxy.

The proxy image ships with a default policy that blocks all traffic. You must mount a policy file to allow any outbound requests. agentbox init will set this up for you.

How it works

The agent container has HTTP_PROXY/HTTPS_PROXY set to point at the proxy sidecar. The proxy runs a mitmproxy addon (enforcer.py) that checks every HTTP request and HTTPS CONNECT tunnel against the domain allowlist. Non-matching requests get a 403 response.

The agent's iptables firewall (init-firewall.sh) blocks all direct outbound except to the Docker bridge network. This means even if an application ignores the proxy env vars, it cannot reach the internet directly.

The proxy's CA certificate is shared via a Docker volume and automatically installed into the agent's system trust store at startup.

Customizing the policy

The network policy lives in your project in the .agent-sandbox directory. This file can be checked into version control and shared with your team.

To edit the policy file:

agentbox edit policy

This opens the network policy file in your editor. If you save changes, the proxy service will automatically restart to apply the new policy.

Example policy:

services:
  - claude

domains:
  # Add your own
  - registry.npmjs.org
  - pypi.org

The .agent-sandbox directory is mounted read-only inside the agent container, preventing the agent from modifying the policy. The proxy reads the policy at startup, so changes require a restart from the host.

See docs/policy/schema.md for the full policy format reference.

Customization

Git inside the container - Credential setup and SSH-to-HTTPS rewriting
Dotfiles and shell customization - Mount dotfiles and shell.d scripts
Language stacks - Extend the base image with Python, Node, Go, Rust
Image versioning - Pin and bump image digests

Security

This project reduces risk but does not eliminate it. Local dev is inherently best-effort sandboxing.

Key principles:

Minimal mounts: only the repo workspace + project-scoped agent state
Network egress is tightly controlled through sidecar proxy with default deny policy
Firewall verification runs at every container start

Git credentials

If you store git credentials inside the container (via git credential-store or any other method), the token grants access to whatever repositories it was scoped to. A classic personal access token or OAuth token grants access to all repositories your GitHub account can access, not just the current project. The network allowlist limits where data can be sent, but an agent with a broad token could read or modify any of your repos on github.com.

To limit exposure:

Run git from the host - No credentials in the container at all
Use a fine-grained PAT - Scope the token to specific repositories
Use a separate GitHub account - Isolate sandboxed work entirely

IDE devcontainer

Operating as a devcontainer (VS Code or JetBrains) opens a channel to the IDE. Installing IDE extensions can introduce risk. For tighter security, we recommend running in CLI mode (local docker compose and terminal operations, rather than IDE extensions).

Security issues

If you find a sandbox escape or bypass:

Open a GitHub Security Advisory (preferred), or
Open an issue with minimal reproduction details

Roadmap

See docs/roadmap.md for planned features and milestones.

Contributing

PRs welcome for:

New agent support
Improved network policies
Documentation and examples

Please keep changes agent-agnostic where possible and compatible with Colima on macOS.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 260 Commits
.agent-sandbox		.agent-sandbox
.agents/skills		.agents/skills
.claude		.claude
.github/workflows		.github/workflows
cli		cli
docs		docs
images		images
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.gitmodules		.gitmodules
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Sandbox

What it does

Supported agents

Runtime modes

Quick start (macOS + Colima)

1. Install prerequisites

2. Install agent-sandbox CLI

Local install (recommended)

Run through docker image

3. Initialize the sandbox for your project

4. Start the sandbox

5. Agent-specific setup

Network policy

How it works

Customizing the policy

Customization

Security

Git credentials

IDE devcontainer

Security issues

Roadmap

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

mattolson/agent-sandbox

Folders and files

Latest commit

History

Repository files navigation

Agent Sandbox

What it does

Supported agents

Runtime modes

Quick start (macOS + Colima)

1. Install prerequisites

2. Install agent-sandbox CLI

Local install (recommended)

Run through docker image

3. Initialize the sandbox for your project

4. Start the sandbox

5. Agent-specific setup

Network policy

How it works

Customizing the policy

Customization

Security

Git credentials

IDE devcontainer

Security issues

Roadmap

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages