OpenClaw Security Guardrail

Zero-Trust security sandbox for OpenClaw agents. Every agent output is treated as hostile; validation and sandboxing are deterministic and local-first.

Quick Start

npm install
npm run build
npm run test
Use createSecurityGuardrail() to get authority, schema, sandbox, trust.

import { createSecurityGuardrail } from "openclaw-security-guardrail";
import { z } from "zod";

const guardrail = createSecurityGuardrail();
await guardrail.trust.load();

const context = { agentId: "a1", tenantId: "t1", userTier: "pro", skill: "scan" };
guardrail.authority.enforceTier(context);

const safe = guardrail.authority.sanitize({ result: "ok", overrideScore: 99 }).sanitized;
const schema = z.object({ result: z.string(), tenantId: z.string() });
guardrail.schema.validate({ result: "ok", tenantId: "t1" }, schema);

await guardrail.sandbox.execute(async (signal) => {
  if (signal.aborted) throw new DOMException("Aborted", "AbortError");
  return "done";
});

Security Policies

Skill whitelist per tier is configured in src/config/security.config.ts.
Resource limits (timeout 30s, 512MB RAM, 50% CPU best-effort, max payload 2MB).
Forbidden overrides: overrideScore, userTier, eligibility, newBudget are stripped.
Schema enforcement: strict JSON validation with payload size limit and sensitive-field detection.

Tenant Isolation

Always include tenantId in agent context and outputs; AuthorityGuard.enforceTenant rejects cross-tenant data.
Cross-tenant leak test lives in tests/adversarial/cross-tenant.test.ts.
Single-tenant use: set a fixed tenantId (e.g., "default") but keep the check enabled to catch accidental mix-ups.

User Tiers

userTier drives the skill whitelist in security.config.ts; agent calls are blocked if the skill is not allowed for that tier.
Guardrail strips any userTier the agent tries to return; only the platform sets it.
Single-user mode: set userTier: "single" (or any label) and configure the corresponding whitelist; keep enforcement on to prevent privilege creep.

Developer Guide

Always pass AgentContext with tenantId, userTier, skill.
Wrap every agent call with guardLlmCall or manually chain:
- authority.enforceTier(context)
- authority.sanitize(output)
- schema.validate(output, schema)
- sandbox.execute(fn, limits) for skill execution
- trust.recordResult(agentId, success)
Add per-model pricing or trust weights in config as needed.

Folder Layout

src/security: AuthorityGuard, SchemaGuard, SandboxWrapper, TrustEngine
src/domain: SecurityPolicy, AgentContext
src/config: security.config.ts
tests/adversarial: override, cross-tenant, resource exhaustion, schema rejection
tests/unit: trust engine persistence

Notes

Local-first: uses JSON files for trust store, no external infra required.
Deterministic: no AI heuristics; all checks are rule-based.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
examples		examples
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenClaw Security Guardrail

Quick Start

Security Policies

Tenant Isolation

User Tiers

Developer Guide

Folder Layout

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OpenClaw Security Guardrail

Quick Start

Security Policies

Tenant Isolation

User Tiers

Developer Guide

Folder Layout

Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages