feat(agents): add manifest-driven gator agent by johntmyers · Pull Request #1826 · NVIDIA/OpenShell

johntmyers · 2026-06-09T05:15:13Z

Summary

Add a manifest-driven sandbox agent launcher and the first concrete agent, Gator, for validating and monitoring OpenShell issues and PRs. The launcher packages shared runtime adapters, provider profiles, skills, subagents, and prompts into supervised OpenShell sandboxes so gator can run bounded Codex cycles in watch mode.

Related Issue

None.

Changes

Add openshell-agents/run.sh, shared runtime entrypoint, supervisor, Codex harness adapter, and subagent dispatch support.
Add the openshell-agents/gator/ manifest, prompt, README, Dockerfile, scoped GitHub/Codex provider profiles, and sandbox policy.
Add the gator-gate skill for the gator state machine, validation rules, human-response disposition, reviewer invocation, CI/test gating, and scoped GitHub auth guidance.
Add gateway-managed Codex access-token refresh handling and preserve/repair behavior for long-running watch sandboxes.
Harden watch mode so transient transport failures, malformed cycle results, and legacy failure statuses retry without exiting the sandbox.
Document the agent runtime and provider placeholder behavior in openshell-agents/README.md.

Testing

mise run pre-commit passes
Unit tests added/updated
E2E tests added/updated (if applicable)

Additional checks:

bash openshell-agents/runtime/supervisor_test.sh
Live gator sandboxes relaunched and observed Ready on docker-dev
Verified provider-backed Codex and GitHub placeholder resolution inside gator sandboxes

Checklist

Follows Conventional Commits
Commits are signed off (DCO)
Architecture docs updated (not applicable; agent docs live under openshell-agents/)

elezar · 2026-06-10T12:02:22Z

Question: Is the expectation that each agent will have it's own Dockerfile? If so, does it make sense to move this to openshell-agents/gator/Dockerfile instead? Alternatively, we may need to update the README.md to show an (optional?) Dockerfile.agent.

elezar · 2026-06-10T12:58:34Z

+MANIFEST_FILE="$AGENT_DIR/agent.yaml"
+[[ -f "$MANIFEST_FILE" ]] || fail "missing agent manifest: $MANIFEST_FILE"
+
+require_cmd ruby


On the systems that I have setup, I don't have ruby installed and don't get past this requirement. How can we increase the portability of this script? Would adding a new openshell subcommand be too heavy-weight here?

From Codex:

Why does the openshell-agents/run.sh command require ruby?

openshell-agents/run.sh requires Ruby because the Bash launcher uses embedded Ruby snippets for structured host-side work. It explicitly checks for ruby at openshell-agents/run.sh:169. Ruby is used to: - Parse agent.yaml and emit shell-safe variables with YAML + Shellwords: openshell-agents/run.sh:178 - Resolve provider profile YAML files and detect duplicate profile IDs: openshell-agents/run.sh:339 - Read nested values from JSON credential files like ~/.codex/auth.json: openshell-agents/run.sh:399 - Render the prompt template into agent-prompt.md: openshell-agents/run.sh:545 - Patch the staged Dockerfile to copy in the immutable agent payload while preserving the final USER: openshell-agents/run.sh:609 So Ruby is not required by the agent runtime or Codex harness itself. It is a host-side implementation dependency of the launcher. It also appears to be undeclared in mise.toml, so right now it is a hidden local prerequisite unless installed by the OS or user environment.

elezar · 2026-06-10T17:30:45Z

While exercising the gator state machine on #1850, I hit one transition that is worth clarifying in this PR.

When a PR is in gator:approval-needed and a maintainer approval arrives, gator currently re-checks the PR and stays in gator:approval-needed until the PR is merged or closed. Mechanically that is fine, but the label reads as if approval is still missing after approval has already been received.

Can we make this explicit in the gator support added here? Two reasonable options:

Document/comment-template wording: after approval, gator:approval-needed means "maintainer merge decision pending" for an open PR.
Add a distinct state such as gator:merge-ready or gator:merge-pending so the post-approval state is not overloaded.

The current behavior is not blocking #1850, but without this clarification users will reasonably ask why an approved PR still says approval-needed.

copy-pr-bot · 2026-06-10T19:46:11Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

johntmyers added 16 commits June 5, 2026 14:34

chore(gator): add gator gate skill

91fd9d4

chore(gator): add sandbox launcher scaffold

1a21bba

chore(gator): add codex image and docs checks

ba80e91

chore(gator): fold approved provider policy rules

9825e16

chore(gator): add deterministic reviewer runner

567bc88

chore(gator): clarify ok-to-test comments

3dd6607

chore(gator): structure launcher harnesses

c3066ac

chore(gator): require e2e for dependabot

9141c1b

chore(gator): add codex refresh profile

b875880

chore(gator): wip manifest agent launcher

c7306cd

feat(agents): supervise watch cycles in sandbox

d810646

fix(agents): preserve gateway refresh state

3b111f1

fix(gator): continue human response threads

1057af2

fix(agents): keep watch supervisor retrying

8b83535

fix(agents): use refreshed Codex credential aliases

6846b3b

fix(gator): avoid misleading gh auth checks

34c571e

johntmyers requested a review from a team as a code owner June 9, 2026 05:15

johntmyers added 3 commits June 8, 2026 22:23

docs(agents): remove architecture build update

10bc74a

fix(gator): use REST-backed GitHub writes

87b2a10

fix(agents): bake immutable agent payloads

c479d52

johntmyers requested review from derekwaynecarr and mrunalp as code owners June 9, 2026 15:09

fix(agents): upload writable agent workspace

7c3a2eb

elezar reviewed Jun 10, 2026

View reviewed changes

elezar added 5 commits June 10, 2026 15:49

fix(agents): surface gator watch progress

b20fa3f

fix(agents): prevent codex stdin hang

be52f1c

fix(agents): align codex subagent input

40c2314

fix(agents): heartbeat during active cycles

2236088

fix(agents): clean up heartbeat sleep

d97bfb9

fix(agents): disable gh telemetry in codex harness

6a4d720

elezar added 2 commits June 10, 2026 22:32

fix(agents): reconcile closed gator PRs

b886cdf

fix(agents): query closed gator PR labels separately

9ec28c8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(agents): add manifest-driven gator agent#1826

feat(agents): add manifest-driven gator agent#1826
johntmyers wants to merge 28 commits into
mainfrom
feat/gator-gate-skill

johntmyers commented Jun 9, 2026 •

edited

Loading

Uh oh!

elezar Jun 10, 2026

Uh oh!

elezar Jun 10, 2026

Uh oh!

elezar Jun 10, 2026

Uh oh!

elezar commented Jun 10, 2026

Uh oh!

copy-pr-bot Bot commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

johntmyers commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issue

Changes

Testing

Checklist

Uh oh!

elezar Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

elezar Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

elezar Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

elezar commented Jun 10, 2026

Uh oh!

copy-pr-bot Bot commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

johntmyers commented Jun 9, 2026 •

edited

Loading