agent

This repo is a production-oriented agentic platform: a portable core plus a daemon, broker, CLI, and WebUI for durable, audited automation. It is not a prototype — the repo includes deployment checklists, verification scripts, and evidence bundles for repeatable, production-grade runs.

Status

Rolling release: expect breaking changes until the first stability milestone.
Production-oriented verification and deployment tooling (see docs/OPERATIONS.md and docs/DEPLOYMENT.md).
Evidence bundles + repo guards for repeatable runs (tools/capture_agent_evidence_bundle.sh, tools/verify_repo_guards.sh).

Components

agent_core (portable C library): session model, compaction helpers, role utilities.
agentd (daemon): HTTP/SSE APIs, persistence, tool runtime, workflows, approvals, diagnostics.
broker (optional relay): auth + multi-deployment routing, audit/event replay, team runs, OTA fanout.
WebUI (advanced layout): Scene + Conversation, queued prompts, Memory/Trace/Workflows panels, broker console + teams.
CLI (agent): local sessions or remote daemon targeting.

Key capabilities:

OpenAI-compatible providers (OpenAI/OpenRouter/DeepSeek/Moonshot, etc.) with runtime diagnostics and smoke tests.
Durable workflows (DAGs + idempotency) with restart-safe requeueing and evidence tooling.
Team orchestration with role plans, approvals, runtime members, and replayable events.
Memory tooling (recaps, correlation index, trace linking) and operator-grade panels.
Policy/limits hooks for tool budgets, approvals, and automation profiles.
OTA updates (agentd + broker + WebUI) with drain + continuity checks.
Shared codexw broker bridge: the native connector advertises runtime actions plus session/event activity surfaces so codexw's broker, iOS app, and WebUI can show agentd daemon sessions, workflows, client events, and workflow events through /api/v2/runtime-instances/.... Prompt attachments from codexw are uploaded into agentd sessions and submitted as workflow input_files.

NanoClaw leverage (in progress)

We reviewed NanoClaw (cloned under refs/nanoclaw) and captured a concrete leverage plan in docs/spec/nanoclaw_leverage_v0.md. Near-term, low-risk targets include:

Channel/connector self-registration (registry boundary).
Per-group workspace + memory scope isolation.
Optional containerized tool runner with mount allowlist enforcement.
First-class cron schedules for workflows.
Skills-style guided transforms for common upgrades.
Team run multi-result streaming semantics and session anchoring.

These items are targets, not assumptions about current behavior.

Docs quickstart:

Unified handbook (curated summary; generated): docs/HANDBOOK.md
Architecture + roadmap: DESIGN.md (full system map)
Build/verify/test/cleanup/publish workflows: docs/OPERATIONS.md
Deployment checklist (agentd + broker + WebUI): docs/DEPLOYMENT.md
Daemon ops + runtime config: docs/AGENTD.md
CLI usage + tool-loop guidance: docs/CLI.md
WebUI dev/build/runtime config: docs/WEBUI.md
Full docs index: docs/README.md
Evidence bundle capture: tools/capture_agent_evidence_bundle.sh

Handbook guard:

tools/verify_repo_guards.sh enforces a handbook line limit via HANDBOOK_MAX_LINES (default: 250).

Demo: ESP32 voice agentic

esp32_voice_agent.mp4

Build

cmake -S . -B build
cmake --build build -j
ctest --test-dir build --output-on-failure

One-command verify (configure + build + tests; logs under build/):

tools/verify.sh

Include repo hygiene guards:

tools/verify.sh --repo-guards

For a CI-equivalent workspace-size check, run tools/verify_repo_guards.sh --strict.

For the Docker broker lane, run tools/verify_compose_broker_smokes.sh.

Source ${HOME}/.env before verification (provider keys for smokes):

tools/verify_prod.sh

Platform dependencies, Windows helpers, core-only build, and smoke-test notes live in docs/OPERATIONS.md.

Tool extensions

Tool plugins and tool servers are documented in docs/TOOLS.md.

Testing

For network smokes, the real browser E2E harness, and verification workflows, see docs/OPERATIONS.md. Daemon runtime details (config snapshot, defaults update, state safety, and HTTP APIs) live in docs/AGENTD.md.

Run in production (agentd + WebUI)

Production shape:

agentd runs as the backend daemon (tools + persistence + HTTP API).
The WebUI runs as a separate static site (Vite build) that talks to agentd or the broker.

For a full production checklist (TLS, broker/connector, auth hardening, backups), see docs/DEPLOYMENT.md. For WebUI dev/build/runtime config, see docs/WEBUI.md.

codexw shared broker bridge

tools/agentd_codexw_native_broker_connector.py registers agentd as a runtime_kind=agentd instance in the sibling codexw broker. The connector advertises broker.runtime_capabilities.v1 with neutral activity and control surfaces, then serves these runtime-local adapter commands:

GET /api/v1/runtime/sessions: projects daemon sessions and workflows into neutral runtime-instance session rows.
POST /api/v1/runtime/sessions: creates an agentd daemon session through /api/v1/session/new and returns a neutral runtime-instance session row for codexw /api/v2/runtime-instances/{id}/sessions.
GET /api/v1/runtime/events: projects client events and workflow events into neutral broker.runtime_event.v1 rows, with limit, after_id, last_event_id, session_id, and event_prefix filtering.
POST /api/v1/session/{session_id}/files/list and POST /api/v1/session/{session_id}/files/read and POST /api/v1/session/{session_id}/files/write: expose a bounded file-root adapter for the codexw iOS file explorer, including remote document uploads into the selected working-tree directory.
GET /api/v1/session/{session_id}/shells, POST /api/v1/session/{session_id}/shells/start, and GET /api/v1/session/{session_id}/shells/{shell_id}: expose one-shot shell execution so codexw iOS can verify and inspect agentd hosts through the existing host-shell UI. Interactive send is intentionally rejected until agentd owns a durable interactive terminal contract.
GET /api/v1/runtime/events with Accept: text/event-stream is implemented by the external local-API facade for the codexw /api/v2/runtime-instances/{id}/proxy/sse route. The native deployment websocket connector does not advertise surfaces.proxy_sse because codexw's SSE proxy requires a broker-known local API URL rather than a command-result websocket frame.
GET /api/v1/runtime/status: reports read-only update/restart readiness. The default bridge reports update disabled; the native connector reports agentd OTA status when AGENTD_CODEXW_RUNTIME_UPDATE_MODE=agentd_ota. If /api/v1/ota/status includes a candidate, update_candidate, release, or artifact object with url, sha256, version, or drain_timeout_ms, the bridge forwards it as update.candidate so the shared broker can show and prepare the exact runtime.update input. When AGENTD_CODEXW_RUNTIME_RESTART_MODE=agentd_ota, the same route exposes daemon-proven restart readiness from /api/v1/ota/status; restart is enabled only when agentd reports the supervisor boundary agentd_supervisor_restart_drain.
POST /api/v1/runtime/actions now also advertises and forwards voice.webrtc_peer.status, voice.webrtc_peer.start, and voice.webrtc_peer.stop to the existing agentd /api/v1/session/voice_webrtc_peer managed WebRTC runtime. This exposes the voice-peer runtime through the same codexw broker action plane as workflows, schedules, experience records, and operator actions without changing the separate media.direct_p2p=false boundary.
surfaces.proxy_http lets codexw's bounded HTTP proxy call the same /api/v1/runtime/... JSON routes. It does not expose arbitrary agentd paths.

The broker only uses these routes after capability negotiation. Older bridges that do not advertise the surfaces continue to appear in the shared inventory with empty common activity instead of fabricated codexw-specific state.

To prove the deployed codexw broker path end to end from this repo, run:

tools/verify_codexw_live_agentd_activity.sh

The smoke uses the sibling codexw repo's scripts/broker-admin session, issues a one-time deployment enrollment token, approves the temporary proof deployment after certificate enrollment, verifies /api/v2/runtime-instances, /sessions, bounded /proxy/http, /events, and the deployment-scoped file/shell routes used by the codexw iOS Files and Host Shell views against the live broker. It writes proof JSON under build/ and removes the temporary broker deployment unless KEEP_DEPLOYMENT=1 is set. The live broker must be running a codexw build with the v2 runtime-instance session and proxy routes; a 405 method_not_allowed response from POST /api/v2/runtime-instances/{id}/sessions means the broker deployment is stale relative to the connector contract, not that the local agentd connector failed.

To prove the opt-in OTA update boundary without replacing a real daemon, run:

tools/verify_codexw_live_agentd_ota_candidate.sh

That proof connects the native connector to a temporary loopback fake agentd/OTA API, enrolls and approves a short-lived broker deployment, verifies the deployed broker exposes runtime_kind=agentd, update.candidate, and an enabled runtime.update descriptor, then calls broker /actions/preflight. The proof asserts the returned prepared_input contains {url, sha256, version, reason, drain_timeout_ms} from current status instead of a client-built artifact payload, and it verifies the fake OTA update endpoint was not called. Set VERIFY_AGENTD_OTA_DISPATCH=1 only when you intentionally want the proof to dispatch to the fake endpoint as well. No daemon binary is replaced in either mode.

To prove that the deployed codexw broker preserves durable connector readiness status and readiness-transition audit from /api/v1/runtime/status, run:

tools/verify_codexw_live_agentd_connector_readiness.sh

That proof connects the native connector to a temporary loopback fake agentd API with a seeded self-test status JSON file, verifies the live broker exposes the runtime as runtime_kind=agentd, reads /api/v2/runtime-instances/{id}/status, and asserts the top-level connector object contains state, policy_state, last_ok, checked_unix_ms, age_ms, stale_after_ms, failed check count, and failed check names. It then rewrites the durable self-test status from failed to fresh, verifies one-shot connector_alert responses for the first failed observation and the failed-to-fresh recovery, verifies duplicate status reads suppress duplicate alerts, and checks /api/v2/runtime-instances/{id}/audit for durable runtime_instance.connector_readiness_transition events. It removes the temporary deployment unless KEEP_DEPLOYMENT=1 is set.

To prove the opt-in supervisor restart boundary through the deployed codexw broker without restarting a real daemon, run:

tools/verify_codexw_live_agentd_restart.sh

That proof connects the native connector to a temporary loopback fake agentd API that reports restart.enabled=true with safe boundary agentd_supervisor_restart_drain, verifies the live broker exposes an enabled runtime.restart descriptor, calls non-mutating /actions/preflight, dispatches one confirmed restart to the fake /api/v1/ota/restart endpoint, verifies the forwarded {reason, idempotency_key, drain_timeout_ms} body, and verifies a duplicate broker action replays from idempotency instead of forwarding a second restart.

For durable service installs, use the launchd/systemd connector templates in docs/DEPLOYMENT.md. They run agentd and the native codexw connector as separate OS-managed services and keep broker passwords, enrollment secrets, and local AGENTD_AUTH_TOKEN out of process arguments. The connector also supports a read-only --self-test mode for service readiness checks; with --require-broker-visible and broker read credentials, it proves the durable service is enrolled, connected, and visible through shared broker runtime-instance inventory. When AGENTD_CODEXW_RUNTIME_UPDATE_MODE=agentd_ota is enabled, set AGENTD_CODEXW_REQUIRE_UPDATE_PREFLIGHT=1 to make that same self-test call the broker /actions/preflight route and verify the exact status-derived runtime.update payload without dispatching POST /api/v1/ota/update. The macOS launchd installer now installs a companion com.agentd.codexw-connector.self-test LaunchAgent by default, matching the systemd timer behavior with the same broker-visible and optional update preflight checks. It also passes a stable --runtime-instance-id, defaulting to <deployment_id>-runtime unless AGENTD_CODEXW_RUNTIME_INSTANCE_ID is set. That keeps codexw's runtime-instance inventory and native iOS deployment row stable across launchd restarts instead of rotating through random agentd-* ids. Use tools/verify_agentd_codexw_connector_service.py on an installed host to inspect launchd/systemd state, recent self-test logs, redacted service configuration, broker visibility, and optional update preflight in one report. The periodic self-test also writes a durable JSON status file at AGENTD_CODEXW_SELF_TEST_OUTPUT_PATH, giving operators and future client surfaces a stable machine-readable last-readiness result. When the native connector serves GET /api/v1/runtime/status, it reads that file back into the top-level connector status object so the shared codexw broker, WebUI, and native clients can show last self-test freshness and failed checks without host filesystem access. The connector normalizes that local file into an explicit readiness policy: fresh means the last self-test passed and is newer than the stale threshold, stale means it passed but is too old, failed means the last self-test failed or the status file is invalid, and missing means the durable status file is not configured or has not been written yet. The stale threshold defaults to AGENTD_CODEXW_SELF_TEST_STALE_AFTER_SECONDS=900 so the state turns stale after three missed five-minute timer intervals unless the deployment chooses a different policy.

Broker operator actions are opt-in. By default the connector still does not advertise runtime.restart, runtime.update, or runtime.upgrade. Set AGENTD_CODEXW_RUNTIME_UPDATE_MODE=agentd_ota only on hosts where agentd OTA is enabled and configured with a daemon-owned drain/update policy; then the connector advertises runtime.update and forwards broker requests to POST /api/v1/ota/update. Set AGENTD_CODEXW_RUNTIME_RESTART_MODE=agentd_ota only when /api/v1/ota/status reports restart.enabled=true for a systemd/launchd service with safe boundary agentd_supervisor_restart_drain; then the connector advertises runtime.restart and forwards broker requests to POST /api/v1/ota/restart. Compatibility runtime.upgrade remains unadvertised.

Quick start (loopback only; no auth required):

cd /path/to/agent
./build/agentd \
  --host 127.0.0.1 \
  --port 8123 \
  --tools host \
  --yolo \
  --host-scope "$(pwd)" \
  --tools-root "@host" \
  --db-path "$(pwd)/agentd.db"

If you bind to non-loopback (--host 0.0.0.0), agentd requires --auth-token and explicit CORS allowlists. See docs/DEPLOYMENT.md for hardened examples and service configs.

Vendored subtrees live under ref/ and are treated read-only in this repo. If changes are needed, update upstream or copy into first-party code. See docs/VENDORED.md.

Cleanup (disk usage)

Cleanup commands, repo size reports, and guard scripts live in docs/OPERATIONS.md.

Docker Compose (prod-like local verification)

For a prod-like local stack (Postgres + Keycloak + broker + connector + agentd + WebUI):

./tools/verify_compose_stack.sh

Alternatives:

No Docker: tools/verify_mac_local_stack.sh (agentd + WebUI only).
Docker runs but builds blocked: tools/verify_mac_full_stack_host.sh.
No Docker/Keycloak, local Postgres: tools/verify_mac_full_stack_local_postgres.sh.
One-command devstack: tools/devstack_agent.sh (stop with tools/devstack_agent_down.sh, add --wipe-volumes to reset Keycloak).

Tip: local verify supports provider tests via MAC_LOCAL_PROVIDER_TEST=1 (uses AGENTD_TOOLS=basic by default; override with AGENTD_TOOLS=basic|host|none).

Full details, env flags, port notes, and Keycloak guidance live in docs/DEPLOYMENT.md.

Git remote (publishing)

Remote setup and publish helpers live in docs/OPERATIONS.md.

CLI usage

CLI examples and tool-loop guidance live in docs/CLI.md.

Core library

Header: core/include/agent/agent.h
Scope: session model + char-budget compaction + role helpers
No environment variable access in the core (host-only concern).

Daemon + Web UI (day-1)

Local daemon usage and HTTP/SSE API notes live in docs/AGENTD.md. WebUI dev/build/runtime config lives in docs/WEBUI.md.

Name		Name	Last commit message	Last commit date
Latest commit History 1,915 Commits
.github		.github
broker		broker
cli/src		cli/src
core		core
daemon		daemon
docker		docker
docs		docs
esp32_demo		esp32_demo
examples/esp32s3_agent_core		examples/esp32s3_agent_core
packaging/systemd		packaging/systemd
ref		ref
tests		tests
tools		tools
ui		ui
.dockerignore		.dockerignore
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
DESIGN.md		DESIGN.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
TODOS.md		TODOS.md
agents.md		agents.md
docker-compose.yml		docker-compose.yml
project.local.md.example		project.local.md.example
project.md		project.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent

Status

Components

NanoClaw leverage (in progress)

Demo: ESP32 voice agentic

Build

Tool extensions

Testing

Run in production (agentd + WebUI)

codexw shared broker bridge

Cleanup (disk usage)

Docker Compose (prod-like local verification)

Git remote (publishing)

CLI usage

Core library

Daemon + Web UI (day-1)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent

Status

Components

NanoClaw leverage (in progress)

Demo: ESP32 voice agentic

Build

Tool extensions

Testing

Run in production (agentd + WebUI)

codexw shared broker bridge

Cleanup (disk usage)

Docker Compose (prod-like local verification)

Git remote (publishing)

CLI usage

Core library

Daemon + Web UI (day-1)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages