CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

Auto Claude is a multi-agent autonomous coding framework that builds software through coordinated AI agent sessions. It uses the Claude Code SDK to run agents in isolated workspaces with security controls.

Commands

Setup

Requirements:

Python 3.12+ (required for backend)
Node.js (for frontend)

# Install all dependencies from root
npm run install:all

# Or install separately:
# Backend (from apps/backend/)
cd apps/backend && uv venv && uv pip install -r requirements.txt

# Frontend (from apps/frontend/)
cd apps/frontend && npm install

# Set up OAuth token
claude setup-token
# Add to apps/backend/.env: CLAUDE_CODE_OAUTH_TOKEN=your-token

Creating and Running Specs

cd apps/backend

# Create a spec interactively
python spec_runner.py --interactive

# Create spec from task description
python spec_runner.py --task "Add user authentication"

# Force complexity level (simple/standard/complex)
python spec_runner.py --task "Fix button" --complexity simple

# Run autonomous build
python run.py --spec 001

# List all specs
python run.py --list

Workspace Management

cd apps/backend

# Review changes in isolated worktree
python run.py --spec 001 --review

# Merge completed build into project
python run.py --spec 001 --merge

# Discard build
python run.py --spec 001 --discard

QA Validation

cd apps/backend

# Run QA manually
python run.py --spec 001 --qa

# Check QA status
python run.py --spec 001 --qa-status

Testing

# Install test dependencies (required first time)
cd apps/backend && uv pip install -r ../../tests/requirements-test.txt

# Run all tests (use virtual environment pytest)
apps/backend/.venv/bin/pytest tests/ -v

# Run single test file
apps/backend/.venv/bin/pytest tests/test_security.py -v

# Run specific test
apps/backend/.venv/bin/pytest tests/test_security.py::test_bash_command_validation -v

# Skip slow tests
apps/backend/.venv/bin/pytest tests/ -m "not slow"

# Or from root
npm run test:backend

Spec Validation

python apps/backend/validate_spec.py --spec-dir apps/backend/specs/001-feature --checkpoint all

Releases

# 1. Bump version on your branch (creates commit, no tag)
node scripts/bump-version.js patch   # 2.8.0 -> 2.8.1
node scripts/bump-version.js minor   # 2.8.0 -> 2.9.0
node scripts/bump-version.js major   # 2.8.0 -> 3.0.0

# 2. Push and create PR to main
git push origin your-branch
gh pr create --base main

# 3. Merge PR → GitHub Actions automatically:
#    - Creates tag
#    - Builds all platforms
#    - Creates release with changelog
#    - Updates README

See RELEASE.md for detailed release process documentation.

Architecture

Core Pipeline

Spec Creation (spec_runner.py) - Dynamic 3-8 phase pipeline based on task complexity:

SIMPLE (3 phases): Discovery → Quick Spec → Validate
STANDARD (6-7 phases): Discovery → Requirements → [Research] → Context → Spec → Plan → Validate
COMPLEX (8 phases): Full pipeline with Research and Self-Critique phases

Implementation (run.py → agent.py) - Multi-session build:

Planner Agent creates subtask-based implementation plan
Coder Agent implements subtasks (can spawn subagents for parallel work)
QA Reviewer validates acceptance criteria
QA Fixer resolves issues in a loop

Key Components (apps/backend/)

client.py - Claude SDK client with security hooks and tool permissions
security.py + project_analyzer.py - Dynamic command allowlisting based on detected project stack
worktree.py - Git worktree isolation for safe feature development
memory.py - File-based session memory (primary, always-available storage)
graphiti_memory.py - Graph-based cross-session memory with semantic search
graphiti_providers.py - Multi-provider factory for Graphiti (OpenAI, Anthropic, Azure, Ollama, Google AI)
graphiti_config.py - Configuration and validation for Graphiti integration
linear_updater.py - Optional Linear integration for progress tracking

Agent Prompts (apps/backend/prompts/)

Prompt	Purpose
planner.md	Creates implementation plan with subtasks
coder.md	Implements individual subtasks
coder_recovery.md	Recovers from stuck/failed subtasks
qa_reviewer.md	Validates acceptance criteria
qa_fixer.md	Fixes QA-reported issues
spec_gatherer.md	Collects user requirements
spec_researcher.md	Validates external integrations
spec_writer.md	Creates spec.md document
spec_critic.md	Self-critique using ultrathink
complexity_assessor.md	AI-based complexity assessment

Spec Directory Structure

Each spec in .auto-claude/specs/XXX-name/ contains:

spec.md - Feature specification
requirements.json - Structured user requirements
context.json - Discovered codebase context
implementation_plan.json - Subtask-based plan with status tracking
qa_report.md - QA validation results
QA_FIX_REQUEST.md - Issues to fix (when rejected)

Branching & Worktree Strategy

Auto Claude uses git worktrees for isolated builds. All branches stay LOCAL until user explicitly pushes:

main (user's branch)
└── auto-claude/{spec-name}  ← spec branch (isolated worktree)

Key principles:

ONE branch per spec (auto-claude/{spec-name})
Parallel work uses subagents (agent decides when to spawn)
NO automatic pushes to GitHub - user controls when to push
User reviews in spec worktree (.worktrees/{spec-name}/)
Final merge: spec branch → main (after user approval)

Workflow:

Build runs in isolated worktree on spec branch
Agent implements subtasks (can spawn subagents for parallel work)
User tests feature in .worktrees/{spec-name}/
User runs --merge to add to their project
User pushes to remote when ready

Security Model

Three-layer defense:

OS Sandbox - Bash command isolation
Filesystem Permissions - Operations restricted to project directory
Command Allowlist - Dynamic allowlist from project analysis (security.py + project_analyzer.py)

Security profile cached in .auto-claude-security.json.

Memory System

Dual-layer memory architecture:

File-Based Memory (Primary) - memory.py

Zero dependencies, always available
Human-readable files in specs/XXX/memory/
Session insights, patterns, gotchas, codebase map

Graphiti Memory - graphiti_memory.py

Graph database with semantic search (LadybugDB - embedded, no Docker)
Cross-session context retrieval
Multi-provider support:
- LLM: OpenAI, Anthropic, Azure OpenAI, Ollama, Google AI (Gemini)
- Embedders: OpenAI, Voyage AI, Azure OpenAI, Ollama, Google AI
Configure with provider credentials in .env.example

Project Structure

auto-claude/
├── apps/
│   ├── backend/           # Python backend/CLI (the framework code)
│   └── frontend/          # Electron desktop UI
├── guides/                # Documentation
├── tests/                 # Test suite
└── scripts/               # Build and utility scripts

As a standalone CLI tool:

cd apps/backend
python run.py --spec 001

With the Electron frontend:

npm start        # Build and run desktop app
npm run dev      # Run in development mode

.auto-claude/specs/ - Per-project data (specs, plans, QA reports) - gitignored

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Overview

Commands

Setup

Creating and Running Specs

Workspace Management

QA Validation

Testing

Spec Validation

Releases

Architecture

Core Pipeline

Key Components (apps/backend/)

Agent Prompts (apps/backend/prompts/)

Spec Directory Structure

Branching & Worktree Strategy

Security Model

Memory System

Project Structure

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Overview

Commands

Setup

Creating and Running Specs

Workspace Management

QA Validation

Testing

Spec Validation

Releases

Architecture

Core Pipeline

Key Components (apps/backend/)

Agent Prompts (apps/backend/prompts/)

Spec Directory Structure

Branching & Worktree Strategy

Security Model

Memory System

Project Structure