HTTP API Reference

All services communicate over HTTP on localhost. This document covers every endpoint across all SecAI OS services.

Registry API (port 8470)

GET /v1/models

List all registered model artifacts.

Response: 200 OK -- JSON array of artifact objects
Fields per artifact: name, path, sha256, format, source, status, promoted_at, gguf_guard_fingerprint, gguf_guard_manifest

GET /v1/model/{name}

Retrieve a single artifact by name.

Response: 200 OK -- JSON artifact object
Error: 404 Not Found -- model not in registry

POST /v1/promote

Promote a quarantined model to the registry.

Request body:

{
  "name": "model-name",
  "path": "/var/lib/secure-ai/quarantine/model-file",
  "sha256": "hash",
  "format": "gguf",
  "source": "huggingface"
}

Response: 200 OK -- promotion successful
Error: 400 Bad Request -- validation failed

DELETE /v1/model/{name}

Remove a model from the registry.

Response: 200 OK -- model removed
Error: 404 Not Found -- model not in registry

POST /v1/model/verify-manifest

Verify SHA-256 integrity of all registered models.

Response: 200 OK -- JSON object with per-model verification results

Tool Firewall API (port 8475)

POST /v1/tool/invoke

Invoke a tool through the policy firewall.

Request body:

{
  "tool": "tool_name",
  "arguments": { "key": "value" }
}

Response: 200 OK -- tool invocation allowed and result returned
```
{
  "status": "allowed",
  "result": { ... }
}
```

Error: 403 Forbidden -- tool denied by policy

{
  "status": "denied",
  "reason": "description of why the tool was denied"
}

Error: 400 Bad Request -- invalid arguments (blocked pattern, length exceeded)
Error: 429 Too Many Requests -- rate limit exceeded

Airlock API (port 8490)

POST /v1/proxy

Proxy an outbound request through the Airlock.

Request body:

{
  "url": "https://example.com/path",
  "method": "GET",
  "headers": {},
  "body": null
}

Response: 200 OK -- proxied response returned
Error: 403 Forbidden -- destination not allowlisted, PII detected, or credentials detected
```
{
  "error": "description of block reason"
}
```
Error: 429 Too Many Requests -- rate limit exceeded
Error: 503 Service Unavailable -- Airlock is disabled

Agent API (port 8476)

POST /v1/task

Submit a new task for the agent to plan and execute.

Request body:

{
  "intent": "summarize the documents in my workspace",
  "mode": "standard",
  "workspace": ["user_docs"],
  "preferences": { "read_file": "always" }
}

Fields:
- workspace: array of workspace IDs (not raw paths). Available IDs: user_docs, outputs. Resolved to filesystem paths server-side.
Response: 201 Created -- task with planned steps
Error: 400 Bad Request -- missing intent, invalid mode, or unknown workspace ID

GET /v1/task/{id}

Get task status and step details.

Response: 200 OK -- task object with steps
Error: 404 Not Found -- task not found

POST /v1/task/{id}/approve

Approve pending steps that require user confirmation.

Request body:

{
  "step_ids": ["abc123"],
  "approve_all": false
}

Response: 200 OK -- updated task

POST /v1/task/{id}/deny

Deny pending steps.

Request body:

{
  "step_ids": ["abc123"],
  "deny_all": false
}

Response: 200 OK -- updated task

POST /v1/task/{id}/cancel

Cancel a running or pending task.

Response: 200 OK -- task cancelled
Error: 409 Conflict -- task already completed/failed/cancelled

GET /v1/tasks

List all tasks (most recent first).

Query params: limit (default 50, max 200)
Response: 200 OK -- array of task objects

GET /v1/modes

List available operating modes with descriptions.

Response: 200 OK -- array of mode objects (offline_only, standard, online_assisted, sensitive)

UI API (port 8480)

Model Management

GET /api/models

List all models (combines registry and quarantine data).

Response: 200 OK -- JSON array of model objects with status information

POST /api/models/download

Initiate a model download through the Airlock.

Request body:

{
  "url": "https://huggingface.co/...",
  "name": "model-name"
}

Response: 200 OK -- download initiated
Error: 400 Bad Request -- invalid URL or name
Error: 503 Service Unavailable -- Airlock disabled

POST /api/models/import

Import a local model file into quarantine.

Request body: Multipart form data with model file
Response: 200 OK -- model submitted to quarantine
Error: 400 Bad Request -- invalid file or format

POST /api/models/verify-manifest

Trigger integrity verification of all registered models.

Response: 200 OK -- verification results

Chat and Generation

POST /api/chat

Send a chat message and receive a response from the active model.

Request body:

{
  "message": "user message text",
  "model": "model-name",
  "conversation_id": "optional-id"
}

Response: 200 OK -- streaming or complete response from the model

POST /api/generate

Generate text from a prompt (non-chat completion).

Request body:

{
  "prompt": "prompt text",
  "model": "model-name",
  "max_tokens": 512
}

Response: 200 OK -- generated text

Diffusion Runtime (On-Demand Acquisition)

The diffusion runtime (PyTorch, diffusers, etc.) is not included in the base OS image. These endpoints manage the one-click install flow.

Contract:

GET /api/diffusion/runtime/status is the source of truth for whether the runtime is installed, failed, or available for install. Always safe to call.
POST /api/diffusion/runtime/enable requests installation by writing a marker file. A systemd path unit triggers the privileged installer.
GET /api/diffusion/runtime/progress is only meaningful after enable has been called. Callers should check status first.

GET /api/diffusion/runtime/status

Return the current diffusion runtime state.

Response: 200 OK

{
  "installed": false,
  "detected_backend": "cuda",
  "estimated_size_mb": 4500,
  "cache_available": false,
  "installing": false,
  "manifest_populated": false,
  "error": null
}

Fields:
- installed -- true if the runtime is installed and the service is enabled
- detected_backend -- auto-detected GPU backend: "cuda", "rocm", "cpu", or null if detection failed
- estimated_size_mb -- estimated download size from manifest for the detected backend; null if backend unknown
- cache_available -- true if verified wheel cache exists (faster re-install)
- installing -- true if an install is in progress (request marker or active progress)
- manifest_populated -- true if the runtime manifest has real package hashes. If false, enable returns 503. Run scripts/refresh-diffusion-locks.sh to populate.
- error -- error detail from the last failed install, or null
Status priority: installed > failed (suppresses installing) > in-progress > not installed

POST /api/diffusion/runtime/enable

Request diffusion runtime installation.

Response: 202 Accepted -- install requested
```
{ "status": "installing" }
```
Response: 200 OK -- already installed
```
{ "status": "already_installed" }
```
Error: 409 Conflict -- install already in progress
```
{ "status": "already_installing" }
```
Notes: Does not directly run the installer. Atomically creates a request marker file (O_CREAT|O_EXCL, mode 0600). A systemd path unit detects the marker and starts the privileged oneshot installer.

GET /api/diffusion/runtime/progress

Return current install progress from the installer's progress file.

Response: 200 OK

{
  "phase": "downloading",
  "percent": 45,
  "backend": "cuda",
  "detail": "Downloading torch-2.3.1+cu121...",
  "total_packages": 42,
  "downloaded": 19,
  "verified": 19,
  "cached_hits": 5,
  "error": null
}

Valid phases: detecting, downloading, verifying, installing, smoke_testing, enabling, complete, failed, or null
Notes: Only meaningful after POST /api/diffusion/runtime/enable has been called. When no install has ever been requested, returns phase: null. When the install completed, returns complete; when it failed, returns failed. Invalid phases from the progress file are normalized to failed. All branches return the same field set.

Vault Management

GET /api/vault/status

Get the current vault lock/unlock status.

Response: 200 OK

{
  "status": "unlocked",
  "locked_at": null,
  "auto_lock_minutes": 15
}

POST /api/vault/lock

Lock the encrypted vault immediately.

Response: 200 OK -- vault locked

POST /api/vault/unlock

Unlock the encrypted vault with a passphrase.

Request body:
```
{
  "passphrase": "user-passphrase"
}
```
Response: 200 OK -- vault unlocked
Error: 401 Unauthorized -- incorrect passphrase
Error: 429 Too Many Requests -- rate limited after failed attempts

POST /api/vault/keepalive

Reset the vault auto-lock idle timer.

Response: 200 OK -- timer reset

Emergency

POST /api/emergency/panic

Trigger an emergency panic action (locks vault, optionally shuts down).

Response: 200 OK -- panic action executed

Updates

GET /api/updates/check

Check for available OS updates.

Response: 200 OK

{
  "available": true,
  "version": "42.20260308",
  "changelog": "..."
}

POST /api/updates/stage

Download and stage an update without applying it.

Response: 200 OK -- update staged

POST /api/updates/apply

Apply a staged update (requires reboot).

Response: 200 OK -- update applied, reboot required

POST /api/updates/rollback

Roll back to the previous OS deployment.

Response: 200 OK -- rollback staged, reboot required

Hardware

POST /api/vm/gpu

Get GPU information and status.

Response: 200 OK

{
  "detected": true,
  "type": "nvidia",
  "name": "NVIDIA RTX 5080",
  "vram_mb": 16384
}

Security

GET /api/security/status

Get the overall security status of the appliance.

Response: 200 OK

{
  "vault_status": "unlocked",
  "firewall_active": true,
  "airlock_enabled": false,
  "search_enabled": false,
  "integrity_ok": true,
  "last_integrity_check": "2026-03-08T12:00:00Z"
}

Search Mediator API (port 8485)

POST /search

Submit a sanitized web search query.

Request body:

{
  "query": "search terms",
  "max_results": 5
}

Response: 200 OK

{
  "results": [
    {
      "title": "Page Title",
      "url": "https://example.com",
      "snippet": "Relevant text excerpt..."
    }
  ],
  "query_sanitized": true,
  "results_filtered": 0
}

Error: 503 Service Unavailable -- Search Mediator is disabled

FilesExpand file tree

api.md

Latest commit

History

api.md

File metadata and controls

HTTP API Reference

Registry API (port 8470)

GET /v1/models

GET /v1/model/{name}

POST /v1/promote

DELETE /v1/model/{name}

POST /v1/model/verify-manifest

Tool Firewall API (port 8475)

POST /v1/tool/invoke

Airlock API (port 8490)

POST /v1/proxy

Agent API (port 8476)

POST /v1/task

GET /v1/task/{id}

POST /v1/task/{id}/approve

POST /v1/task/{id}/deny

POST /v1/task/{id}/cancel

GET /v1/tasks

GET /v1/modes

UI API (port 8480)

Model Management

GET /api/models

POST /api/models/download

POST /api/models/import

POST /api/models/verify-manifest

Chat and Generation

POST /api/chat

POST /api/generate

Diffusion Runtime (On-Demand Acquisition)

GET /api/diffusion/runtime/status

POST /api/diffusion/runtime/enable

GET /api/diffusion/runtime/progress

Vault Management

GET /api/vault/status

POST /api/vault/lock

POST /api/vault/unlock

POST /api/vault/keepalive

Emergency

POST /api/emergency/panic

Updates

GET /api/updates/check

POST /api/updates/stage

POST /api/updates/apply

POST /api/updates/rollback

Hardware

POST /api/vm/gpu

Security

GET /api/security/status

Search Mediator API (port 8485)

POST /search