RalphMeter

The physical unit for AI code synthesis.

Ralph — Tokens per Verified LOC. Lower is better.

Built for Ralph — the autonomous AI coding methodology.

What This Measures

PRD (Intent) → [Agent + Energy] → Verified Application

RalphMeter captures the transformation from human intent to working code, measuring:

Verified LOC (vLOC): Lines of code that pass all applicable gates
Ralph: Tokens per Verified LOC (lower is better) — the cost of synthesis
vLOC/M: Verified LOC per Minute — effective productivity
Verification Rate: vLOC / LOC — how much survives the gates
PoE-LOC: Probability of Error per Line of Code

For true energy accounting, Ralph can be converted to Joules using provider-specific token-to-watt estimates.

Measurement Hierarchy

Session (full Ralph run)
├── Story 1 (may take N iterations)
│   ├── Iteration 1: +1,200 tokens
│   ├── Iteration 2: +1,400 tokens  
│   └── Iteration 3: PASSES
├── Story 2 (1 iteration)
│   └── Iteration 1: +800 tokens, PASSES
└── Story 3 (stuck)
    └── Iterations 1-5: +10,000 tokens, no pass

Cumulative Ralph — recalculated after each story:

After Story 1:  3,600 tokens,  100 LOC → Ralph = 36
After Story 2:  4,400 tokens,  180 LOC → Ralph = 24 ✓ improving
After Story 3: 14,400 tokens,  180 LOC → Ralph = 80 ⚠️ spike!

Signal	Meaning
Ralph trending down	Healthy convergence
Ralph spike	Problem story — tokens burned, little LOC added
Ralph flat	Steady progress

LOC is measured as a codebase snapshot. Ralph = cumulative tokens / current LOC. Spikes reveal problem stories; per-story deltas enable drill-down analysis.

The 3-Gate Model

Each gate provides line-level verification through different mechanisms:

Gate	What It Checks	Line-Level How	Applies When
G1	Compiles	Compiler errors point to specific lines	Always
G2	Correct	Test coverage maps passing tests to lines	Has tests
G3	Reachable	Runtime coverage shows executed lines	Has explorable surface

A line is verified when it passes all applicable gates:

Line 42: validateUser(input)
  G1 ✓ No compile error
  G2 ✓ Covered by passing test  
  G3 ✓ Executed during exploration
  → Verified ✓

Line 87: unusedHelper()
  G1 ✓ Compiles fine
  G2 ✗ Not covered by any passing test
  G3 ✗ Never executed
  → NOT verified (dead code)

Configurable Thresholds

Story success can depend on gate thresholds:

{
  "gates": {
    "G1": { "required": true },
    "G2": { "required": true, "threshold": 0.80 },
    "G3": { "skip": true }
  }
}

required: Must pass for story to complete
threshold: Minimum % of lines that must pass this gate
skip: Gate is not evaluated (e.g., no tests yet, no explorable surface)

Code is verified when all applicable gates pass at the line level.

Extensibility

The 3-gate model is the default, but gates are pluggable:

Add custom gates (security scans, performance budgets, accessibility)
Skip gates that don't apply to your project
Configure pass thresholds per gate

See Quality Gate Plugins for the extension API.

Session Metadata

Sessions support optional tags for arbitrary metadata:

{
  "sessionId": "abc-123",
  "tags": {
    "mode": "DEVELOP",
    "methodology": "ralph-wiggum",
    "human_intervention": "false"
  }
}

Use tags to:

Track different methodologies (agent modes, custom workflows)
A/B test agent configurations
Compare human-assisted vs fully autonomous sessions

Quick Start

npm install
npm run dev

Best Practices

The 3:1 Refactoring Rule

For sustainable development, follow a 3:1 rhythm:

PRD 1: Feature A     → Ralph = 32, vRate = 78%
PRD 2: Feature B     → Ralph = 28, vRate = 75%
PRD 3: Feature C     → Ralph = 35, vRate = 68% ⚠️ cruft accumulating
PRD 4: Refactor      → Ralph = 42, vRate = 91% ✓ cleaned up

After every 3 feature PRDs, add a refactoring PRD.

Note: Ralph will likely increase after refactoring (you spent tokens without adding features). The win is verification rate — the refactoring PRD:

Removes dead code that G3 revealed as unreachable
Consolidates duplicate patterns the agent created
Adds tests for undertested areas (improves G2 coverage)
Pays down technical debt before it compounds

The goal isn't lowering Ralph in that moment — it's improving vLOC/LOC and preventing future Ralph explosions from working in a messy codebase.

Ralph Integration

RalphMeter provides a hook system that can be integrated into Ralph's loop to automatically meter agent activity.

TypeScript/JavaScript Integration

import { createRalphHooks } from 'ralphmeter';

// Create hooks instance
const hooks = createRalphHooks('http://localhost:3333');

// Start session with optional tags
await hooks.onSessionStart({
  tags: {
    mode: 'DEVELOP',
    methodology: 'ralph-wiggum',
    human_intervention: 'false'
  }
});

// Iteration loop
for (let i = 1; i <= maxIterations; i++) {
  await hooks.onIterationStart({
    iterationNumber: i,
    storyId: currentStory.id
  });

  // Track token usage
  await hooks.onTokensIn({ count: promptTokens, model: 'gpt-4' });
  await hooks.onTokensOut({ count: completionTokens, model: 'gpt-4' });

  // Track compilation
  const compileResult = await compile();
  await hooks.onCompilation({
    success: compileResult.success,
    errorCount: compileResult.errors.length,
    errors: compileResult.errors
  });

  // Track tests
  const testResult = await runTests();
  await hooks.onTestRun({
    success: testResult.success,
    totalTests: testResult.total,
    passed: testResult.passed,
    failed: testResult.failed,
    coveragePercent: testResult.coverage
  });

  // End iteration
  await hooks.onIterationEnd({
    iterationNumber: i,
    storyId: currentStory.id,
    success: testResult.success && compileResult.success
  });

  // Mark story complete if iteration succeeded
  if (testResult.success && compileResult.success) {
    await hooks.onStoryComplete({
      storyId: currentStory.id,
      passes: true,
      locCount: await countLOC()
    });
    break;
  }
}

// End session
await hooks.onSessionEnd({
  success: allStoriesComplete,
  reason: 'All stories completed'
});

Shell Script Integration

You can integrate RalphMeter into shell-based Ralph loops like ralph.sh:

#!/bin/bash
# ralph.sh with RalphMeter integration

METER_URL="http://localhost:3333"
SESSION_ID=$(uuidgen)

# Helper to send events to RalphMeter
send_event() {
  local event_type=$1
  local payload=$2
  
  curl -s -X POST "$METER_URL/api/sessions/$SESSION_ID/events" \
    -H "Content-Type: application/json" \
    -d "{
      \"event\": {
        \"timestamp\": \"$(date -u +%Y-%m-%dT%H:%M:%S.%3NZ)\",
        \"sessionId\": \"$SESSION_ID\",
        \"eventType\": \"$event_type\",
        \"payload\": $payload
      }
    }"
}

# Create session
curl -s -X POST "$METER_URL/api/sessions" \
  -H "Content-Type: application/json" \
  -d '{"tags": {"mode": "DEVELOP"}}'

# Start session
send_event "session_start" '{"tags": {"mode": "DEVELOP"}}'

# Iteration loop
ITERATION=1
STORY_ID="US-001"

while [ $ITERATION -le 10 ]; do
  echo "Iteration $ITERATION for $STORY_ID"
  
  # Start iteration
  send_event "iteration_start" "{\"iterationNumber\": $ITERATION, \"storyId\": \"$STORY_ID\"}"
  
  # Run your agent code here...
  # Track tokens, compilation, tests, etc.
  
  # Example: track compilation
  if npm run typecheck 2>&1 | grep -q "error"; then
    send_event "compilation_result" '{"success": false, "errorCount": 1}'
    SUCCESS=false
  else
    send_event "compilation_result" '{"success": true}'
    SUCCESS=true
  fi
  
  # End iteration
  send_event "iteration_end" "{\"iterationNumber\": $ITERATION, \"storyId\": \"$STORY_ID\", \"success\": $SUCCESS}"
  
  # Break if successful
  if [ "$SUCCESS" = "true" ]; then
    send_event "story_complete" "{\"storyId\": \"$STORY_ID\", \"passes\": true}"
    break
  fi
  
  ITERATION=$((ITERATION + 1))
done

# End session
send_event "session_end" '{"success": true, "reason": "Completed"}'

Configuration Options

const hooks = createRalphHooks({
  meterUrl: 'http://localhost:3333',
  sessionId: 'custom-session-id', // Optional: auto-generated if not provided
  verbose: true,                   // Optional: log hook calls to console
  timeout: 5000                    // Optional: request timeout in ms (default: 5000)
});

Viewing Metrics

After running a Ralph session with hooks enabled:

# View session summary
curl http://localhost:3333/api/sessions

# Get detailed metrics for a session
curl http://localhost:3333/api/sessions/{sessionId}/metrics

# Export session data in open format (for ecosystem tooling)
curl http://localhost:3333/api/sessions/{sessionId}/export

# Or use the CLI
ralphmeter report {sessionId}

Status

🚧 Under construction — being built by Ralph, for Ralph.

The Vision

An open standard for measuring AI coding agent efficiency, with:

Open spec for the metric format
Reference benchmarks for calibration
Commercial tooling for insights

"We're measuring exhaust, not combustion. The actual energy is reliable intent transformation under uncertainty."

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.github/agents		.github/agents
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
AGENTS.md		AGENTS.md
API-USAGE.md		API-USAGE.md
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
eslint.config.js		eslint.config.js
image.png		image.png
package-lock.json		package-lock.json
package.json		package.json
prd.json		prd.json
progress.txt		progress.txt
ralph.sh		ralph.sh
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RalphMeter

What This Measures

Measurement Hierarchy

The 3-Gate Model

Configurable Thresholds

Extensibility

Session Metadata

Quick Start

Best Practices

The 3:1 Refactoring Rule

Ralph Integration

TypeScript/JavaScript Integration

Shell Script Integration

Configuration Options

Viewing Metrics

Status

The Vision

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RalphMeter

What This Measures

Measurement Hierarchy

The 3-Gate Model

Configurable Thresholds

Extensibility

Session Metadata

Quick Start

Best Practices

The 3:1 Refactoring Rule

Ralph Integration

TypeScript/JavaScript Integration

Shell Script Integration

Configuration Options

Viewing Metrics

Status

The Vision

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages