docs(rfc): SDK design principles and consumption entrypoints by zanetworker · Pull Request #1590 · NVIDIA/OpenShell

zanetworker · 2026-05-27T11:45:46Z

Related: #1044

Summary

RFC 0006 proposing official Python and TypeScript SDKs that make OpenShell consumable as programmable infrastructure for agent platforms and frameworks, plus streaming file transfer RPCs.

Why: Agent platforms (Anthropic Managed Agents, OpenAI Agents SDK, OpenClaw, Cloudflare) all need a secure execution layer. OpenShell has the enforcement (Landlock, seccomp, L4/L7 policy, credential injection, OCSF audit) and the API (54 gRPC RPCs). But only 8 RPCs are wrapped in the Python SDK, there's no TypeScript SDK, no file transfer RPC, and no OIDC auth in the SDK. Every integration must shell out to the CLI binary or build a custom gRPC client.

What this RFC proposes:

Extend the Python SDK with OIDC auth, provider attach/detach, streaming watch, and file transfer
Add streaming UploadFile/DownloadFile gRPC RPCs to the gateway (routed via existing RelayStream infrastructure)
Ship an official TypeScript SDK (for OpenClaw and Node.js frameworks)
OIDC authentication in both SDKs for cross-namespace K8s deployments

Three sandbox modes covered:

Mode 1 (entire agent sandboxed): CLI-driven, no SDK needed
Mode 2 (platform-managed): SDK in your worker, brain on the platform (Anthropic, OpenAI Responses API)
Mode 3 (framework extension): SDK embedded in the developer's process (OpenAI Agents SDK, OpenClaw)

Five implementation phases with a dependency analysis showing Phase 1 (OIDC + providers) and Phase 2 (file transfer) can run in parallel.

Related Issues

Extends the Python SDK at python/openshell/sandbox.py
Builds on PR feat(auth): per-sandbox authentication to gateway #1404 (per-sandbox auth, merged) and PR feat(auth): add OIDC/Keycloak authentication with RBAC and scope-based permissions #935 (OIDC gateway support, merged)
Coordinates with PR fix(python): raise SandboxError instead of FileNotFoundError or KeyError #1547 (Python SDK fixes, open) and PR build(python): publish manylinux_2_28 wheels for broader glibc compat… #1117 (Python wheels, open)

Changes

rfc/0006-sdk-and-file-transfer/README.md - Full RFC document
rfc/0006-sdk-and-file-transfer/sdk-modes.png - Three sandbox modes architecture diagram
rfc/0006-sdk-and-file-transfer/sdk-phase-deps.{mmd,png} - Phase dependency diagram
rfc/0006-sdk-and-file-transfer/sdk-file-transfer.png - UploadFile/DownloadFile sequence diagram
rfc/0006-sdk-and-file-transfer/sdk-anthropic-worker.png - Anthropic Mode 2 end-to-end sequence

Testing

mise run pre-commit passes
Markdown lint clean
All diagrams render correctly

Checklist

Follows Conventional Commits
Commits are signed off (DCO)
RFC follows the rfc/0000-template structure
RFC number (0006) does not conflict with existing or in-flight RFCs

Add RFC 0006 proposing official Python and TypeScript SDKs for programmatic sandbox consumption by agent platforms and frameworks. The RFC covers: - Three sandbox modes and which the SDK serves (Mode 2 and 3) - Extending the Python SDK with OIDC auth, provider management, streaming watch, and file transfer - Streaming UploadFile/DownloadFile gRPC RPCs for the gateway - A new TypeScript SDK for OpenClaw and Node.js frameworks - Five implementation phases with dependency analysis - Integration examples for Anthropic Managed Agents and OpenAI Agents SDK Includes architecture diagrams for the three modes, file transfer sequence flow, phase dependencies, and Anthropic worker end-to-end. Signed-off-by: Adel Zaalouk <azaalouk@redhat.com>

copy-pr-bot · 2026-05-27T11:45:50Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

derekwaynecarr · 2026-05-28T13:11:45Z

+frameworks. Add streaming UploadFile/DownloadFile gRPC RPCs to the
+gateway so SDK consumers can move files in and out of sandboxes
+without shelling out to the CLI. Support OIDC authentication in both
+SDKs so cross-namespace K8s deployments work without copying mTLS


can you clarify what is meant by cross namespace deployments?

Good call. "Cross-namespace" is K8s jargon that does not belong here. What I meant: the current Python SDK only supports mTLS, which requires distributing TLS client certificates to every consumer. In a K8s deployment that means copying Secrets across namespaces, but the same friction applies outside K8s — any consumer on a different machine needs those certs.

OIDC removes that distribution problem regardless of deployment model. I will rewrite this section to frame it as "SDK consumers need bearer-token auth so they can connect to any OIDC-enabled gateway without distributing client certificates."

derekwaynecarr · 2026-05-28T13:16:46Z

+
+| Method | RPC | Why |
+|--------|-----|-----|
+| OIDC auth | gRPC metadata interceptor | mTLS-only locks SDK to single namespace. Every K8s production deployment needs cross-namespace auth. |


I don’t understand what oidc has to do with k8s, agree the sdk needs to work with a server that is oidc auth

Agreed — OIDC has nothing to do with K8s specifically. The SDK needs to work with any gateway that has OIDC auth enabled, whether that is on K8s, bare metal, or a managed service. I conflated "the most common deployment where this matters" with "the reason it matters." Will decouple the two in the next revision.

zanetworker · 2026-05-29T11:38:22Z

Positioning relative to RFC 0005 (#1617) and the Python SDK OIDC PR (#1621)

Three SDK efforts are in flight under #1044. Here is how they relate:

PR	Scope	State
#1617 (RFC 0005, @maxdubrinsky)	Extract `openshell-sdk` Rust crate, ship `@openshell/sdk` TypeScript binding via napi-rs, refactor CLI onto SDK	RFC + working prototype
#1621 (@mrunalp)	Add OIDC bearer auth to the existing pure-Python SDK	Code, ready for review
#1590 (this RFC 0006)	Broader SDK roadmap: consumption patterns (Mode 1/2/3), file transfer RPCs, Python SDK surface expansion, platform integration examples (Anthropic, OpenAI, OpenClaw)	RFC, no code

RFC 0005 and RFC 0006 are complementary, not competing. RFC 0005 delivers the shared Rust core and TypeScript binding — the "how" for the TS half. This RFC frames the "why" and "what" across both languages: which consumption patterns platforms need, what RPC gaps block adoption (file transfer), and how the SDK surface maps to real platform integrations.

Concretely, this RFC covers areas that RFC 0005 explicitly defers or does not address:

File transfer RPCs (UploadFile/DownloadFile): gateway proto changes, streaming design, routing via ConnectSupervisor/RelayStream
Python SDK surface expansion: provider attach/detach, watch, policy, services (wrapping existing RPCs, not rebuilding transport)
Platform consumption patterns: Anthropic Managed Agents (Mode 2), OpenAI Agents SDK (Mode 3), OpenClaw, CI/CD
Python-on-shared-core migration path: RFC 0005 defers this as a non-goal; this RFC scopes it as a future phase

The intended reading order: this RFC for the overall SDK strategy and surface area, RFC 0005 for the shared core architecture and TS binding implementation.

Will update the RFC text to reference RFC 0005 directly and to decouple OIDC from K8s framing per @derekwaynecarr feedback.

drew · 2026-06-02T05:06:20Z

+**Mode 1: Sandbox the entire agent.** The agent process runs inside
+the sandbox. Interface: CLI. No SDK needed.
+
+**Mode 2: Platform-managed sandbox.** The platform (Anthropic, OpenAI)
+owns the agent loop. A separate worker on your infrastructure embeds
+the SDK and creates sandboxes. Brain and worker are physically
+separate systems. Mode 2 is a spectrum:


I've heard this commonly described as: Agent in a Sandbox (mode 1) and Sandbox as a Tool (mode 2).

Reference: https://www.langchain.com/blog/the-two-patterns-by-which-agents-connect-sandboxes.

OpenShell has always intended to support both. If that analogy is similar to mode 1 and 2, I would suggest framing this RFC around those two terms. This is how I think we should document usage as well. "platform managed sandbox" seems ambiguous to me because "platform" in this case refers more concretely to "brain" and isn't necessarily specific to anthropic or openai, it could just as well be some agentic loop running on my laptop.

So here is that blind-spot from my perspective.

Today, we have agentic APIs like OpenAI's responses API which exposes "containers" or the containers API, that API is usually another abstractions, there are now open implementations of those APIs. For example see: ogx-ai/ogx#5892

An SDK usually has laxer boundries not APIs that we need to "conform" with or implement rather, code interfaces, where we can plug into via libraries. See API implementation vs. Framework invocation .

It really is "who owns the implementation", in mode 2, its the platform provider/API provider, in mode 3, its the developer, or the framework abstraction. Does that make sense?

That said, I think we can simplify. Basically, we could say, we don't care "who" owns the implementation or what "manages" it, from our end we just need to enable "sandbox as a tool" and it should work with both. The only tradeoff is that we lose visibility into invocations that happen via mode 2 from an integration standpoint.

Wdyt?

It really is "who owns the implementation", in mode 2, its the platform provider/API provider, in mode 3, its the developer, or the framework abstraction. Does that make sense?

In mode 2 and 3 the same OpenShell SDKs and APIs are getting used, right?

yes If we consolidate on how the SDKs/APIs are getting used (the receiver) its boils down to two modes (so 2,3 the same API/SDKs are getting used). If we want to consider the invoker, and the how from a persona/ownership standpoint. The separation between Mode 2, 3 was intended to express that

platform controls how the SDKs/APIs of OpenShell (the developer just consumes an API abstractions like the responses API/containers API). the developer/Framework having to integrate or call the Openshell themselves.

That idea was that knowing how or where those get invoked is just a mapping of where we would want to go to make a change relating to openshell compatability. For example, contribute to the implementation of those APIs or should the contribution happen at the framework level, or no contribution at all assuming no abstractions in addition to the ones we add with the SDK.

I will reduce to two modes but mention in the text the consumption entrpoint to simplify.

drew · 2026-06-02T05:08:53Z

+
+### Three sandbox modes
+
+The SDK serves Mode 2 and Mode 3. Mode 1 stays CLI-driven.


nit: I think we still want to programmatically launch agents in a sandbox from the SDK. CLI and SDK should generally have API parity outside of things like interactive shell access. For example, I think we also want the policy prover in our SDK.

drew · 2026-06-02T05:11:49Z

+- **Provider CRUD in the SDK.** Providers are created by the platform
+  engineer via CLI. SDK consumers attach existing providers, not
+  create new ones.


Any reason not to include this? I would expect to have api parity between the CLI and SDK.

Yeah, I added that as an open question (number 4):

Provider CRUD in SDK. This RFC scopes the SDK to attach/detach (bind existing providers). Should full provider CRUD (create/update/delete) be in scope for platforms that want fully programmatic provider lifecycle?

This is mostly about personas and concerns decoupling. I.e., do we want developers managing providers, or do we want to keep that gated (e.g., with admin OIDC role) for those permissions, especially in production setups where an instance is shared?

My sense is that it should be possible to lock down provider access, but since many providers are tied to individuals (eg github) we probably should assume these are used across many personas.

Similar philosophy to kube. The entire API surface area is available in the various SDK/CLI interfaces, but access is locked down based on each deployment.

makes sense.

drew · 2026-06-02T05:38:01Z

+| Anthropic worker | Create sandboxes, download skills, run tool calls, retrieve artifacts | No OIDC auth, no file transfer RPC |
+| OpenAI Agents SDK adapter | Implement SandboxClient: materialize Manifest, exec, snapshot | No file transfer RPC (session.write() for LocalDir has no clean implementation) |
+| OpenClaw plugin | Create sandboxes, sync workspace, exec commands | No TypeScript SDK (plugins are TS-only), currently shells out to CLI 5+ times per command |
+| Multi-tenant platform | Per-tenant sandboxes with policies and credentials | No OIDC auth, no provider attach/detach in SDK |


OIDC auth in OpenShell's current form != multi-tenancy right? For example, if you and I both authenticate against a gateway and create sandboxes, we can see each other's work.

Agree, will clarify this bit. That said, probably worth-it exploring multi-tenancy in a separate RFC?

+1 this seems important to start thinking about. I added a roadmap item for it here, #1722.

drew · 2026-06-02T05:46:17Z

+  - "Issue #1044 (SDK roadmap)"
+---
+
+# RFC 0006 - SDK Consumption Entrypoints and File Transfer


nit: sdk design principles and streaming file transfers seem like two distinct things. is there a reason these are linked together? can we decouple the efforts?

I can decouple the title, file transfer is a strong dependency, happy to create a separate issue for that if it makes sense. I was thinking of that RFC as the spec describing where we want to go, and what dependencies we have to get there, and file-transfer was one.

Makes sense. I think maybe this RFC can identify the need/motivation for streaming file transfer but we can defer the design of it separately since it's such a big component that could stand on it's own.

- Move provider CRUD from non-goals to SDK scope (API parity) - Clarify OIDC gives identity, not tenant isolation; per-principal sandbox scoping is separate gateway work - Reference NVIDIA#1617 (shared Rust core) in Risks section - Move TS SDK location and API parity to resolved questions - Add NVIDIA#1617 relationship as open question Signed-off-by: Adel Zaalouk <azaalouk@redhat.com>

- Consolidate Mode 2 and Mode 3 into "Sandbox as a Tool" pattern, aligned with LangChain framing (Agent in Sandbox vs Sandbox as Tool) - Show consumption entrypoints (platform, framework, direct) as variants of the same pattern, not separate modes - Defer file transfer proto design to NVIDIA#1707; keep motivation and need in this RFC - Update diagram to two-pattern layout - Remove resolved questions section (decisions already in proposal) - Add relationship to RFC 0005 (NVIDIA#1617) and link new related PRs Signed-off-by: Adel Zaalouk <azaalouk@redhat.com>

zanetworker requested review from a team, derekwaynecarr, maxamillion and mrunalp as code owners May 27, 2026 11:45

derekwaynecarr reviewed May 28, 2026

View reviewed changes

drew reviewed Jun 2, 2026

View reviewed changes

zanetworker force-pushed the docs/sdk-file-transfer-rfc branch from b57ca6a to fe6e86d Compare June 2, 2026 11:29

zanetworker changed the title ~~docs(rfc): propose SDK consumption entrypoints and file transfer~~ docs(rfc): SDK design principles and consumption entrypoints Jun 2, 2026

pimlock added the rfc label Jun 5, 2026


		### Three sandbox modes

		The SDK serves Mode 2 and Mode 3. Mode 1 stays CLI-driven.

Conversation

zanetworker commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issues

Changes

Testing

Checklist

Uh oh!

copy-pr-bot Bot commented May 27, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

derekwaynecarr May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zanetworker commented May 29, 2026

Uh oh!

drew Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zanetworker Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

drew Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

drew Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zanetworker commented May 27, 2026 •

edited

Loading

derekwaynecarr May 28, 2026 •

edited

Loading

drew Jun 2, 2026 •

edited

Loading

zanetworker Jun 2, 2026 •

edited

Loading

drew Jun 5, 2026 •

edited

Loading

drew Jun 5, 2026 •

edited

Loading