Add subagent work units across providers by juliusmarminge · Pull Request #1199 · pingdotgg/t3code

juliusmarminge · 2026-03-18T22:55:25Z

⚠️ EARLY VIBED DO NOT USE

This stacked draft PR builds on split-stack/claude-sibling and introduces a generic turn-scoped work-unit model for subagent execution.

Today our orchestration model treats a turn as the user-visible response boundary, but it did not have a first-class way to represent delegated or nested execution inside that turn. That left us flattening provider-specific subagent behavior into raw activity text or, in Codex's case, risking child-conversation state being mistaken for top-level turn state. The user-visible consequence was that plan/progress state and future subagent UI work had no durable, provider-agnostic execution model to build on.

The core change in this PR is a new WorkUnitId domain entity and a new orchestration work-unit shape owned by a turn. Activities can now optionally reference a workUnitId, and threads now expose a workUnits collection alongside messages, plans, checkpoints, and activities. This keeps turns as the top-level interaction boundary while giving us a stable execution tree for delegated work. Historical compatibility is preserved by decoding missing workUnits and missing activity workUnitId values to empty/null defaults.

On the persistence side, this PR adds a dedicated projection_thread_work_units table plus a migration that backfills one root work unit per historical turn and links historical activities to that root. Projection write/read paths were updated so work units are projected, reverted, and hydrated as first-class read-model state instead of being reconstructed ad hoc from activity payloads.

On the runtime-ingestion side, provider runtime events now produce work units generically. A turn creates or updates a root primary_agent work unit, and task.started / task.progress / task.completed create or update delegated delegated_agent work units under that root. The delegated work unit keeps runtimeTaskId, and now also preserves the runtime item provenance when the provider can identify the spawning subagent tool item.

Both providers are now mapped into that generic shape. Claude task telemetry is correlated back to the Task tool's tool_use_id, so later Claude task.* events inherit the same subagent item identity. Codex child-conversation task events now inherit the parent collaboration tool item from the manager's receiver-thread routing, so flattened child task progress no longer loses the parent subagent provenance. This still keeps everything on the parent turn and thread timeline for now, but the data model is ready for richer subagent UI later.

I also updated the small fixture surfaces that construct orchestration threads/activities directly so they carry the new required fields, and added regression coverage across contracts, projector, projection pipeline, snapshot hydration, Codex manager routing, both adapters, and runtime ingestion.

Validation run for this branch:

bun fmt
bun lint
bun typecheck
cd packages/contracts && bun run test src/orchestration.test.ts
cd apps/server && bun run test src/orchestration/projector.test.ts
cd apps/server && bun run test src/orchestration/Layers/ProjectionPipeline.test.ts
cd apps/server && bun run test src/orchestration/Layers/ProjectionSnapshotQuery.test.ts
cd apps/server && bun run test src/orchestration/Layers/ProviderRuntimeIngestion.test.ts
cd apps/server && bun run test src/codexAppServerManager.test.ts
cd apps/server && bun run test src/provider/Layers/CodexAdapter.test.ts
cd apps/server && bun run test src/provider/Layers/ClaudeAdapter.test.ts

Note

Add subagent work units tracking across providers and projection pipeline

Introduces OrchestrationWorkUnit as a first-class contract entity with kinds (primary_agent, delegated_agent), states, and provider refs; adds a thread.work-unit.upsert command and thread.work-unit-upserted event to the orchestration system.
Extends ProviderRuntimeIngestion.ts to create and update work units as provider runtime events are processed: a root primary_agent work unit per turn, and child delegated_agent work units for each task.started/task.progress/task.completed event.
Adds a projection_thread_work_units table (migration 016) and a new projector in ProjectionPipeline.ts that persists work units and prunes them on thread.reverted.
Links activities to their work unit via a new work_unit_id column on projection_thread_activities, populated during upsert and exposed in snapshot queries.
Enriches the Codex adapter to route child collab conversation notifications onto the parent turn/item and suppress child lifecycle events from affecting session state.
Adds itemId/providerRefs linkage on Claude task.* and tool.progress events, and surfaces ExitPlanMode tool calls as turn.proposed.completed events (denied via canUseTool).
Risk: getSnapshot now queries an additional table and work unit updatedAt can advance the snapshot's overall updatedAt timestamp.

📊 Macroscope summarized e6d856b. 37 files reviewed, 9 issues evaluated, 1 issue filtered, 4 comments posted

🗂️ Filtered Issues

apps/server/src/orchestration/Layers/ProviderCommandReactor.test.ts — 0 comments posted, 1 evaluated, 1 filtered

line 220: The outer let runtime variable declared at line 64 is never assigned because createHarness declares a local const runtime at line 220 that shadows it. This means the afterEach cleanup at lines 76–79 (if (runtime) { await runtime.dispose(); }) will never execute, leaking the ManagedRuntime (and its underlying SQLite connections, fibers, etc.) after each test. While this is a pre-existing issue (not introduced by the diff), the addition of the new ProjectionThreadWorkUnitRepository and its projector in the pipeline means each leaked runtime now holds slightly more resources. [ Out of scope ]

- add decoding defaults in `AppSettingsSchema` so older persisted settings load safely - export shared `Schema.Literals` types for `EnvMode` and `TimestampFormat` - add a regression test covering pre-new-key settings hydration

Co-authored-by: codex <codex@users.noreply.github.com>

coderabbitai · 2026-03-18T22:59:18Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 7fbcd96b-8120-44c3-b656-d203767c0278

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch split-stack/subagent-work-units

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

macroscopeapp · 2026-03-18T23:04:51Z

apps/server/src/orchestration/Layers/ProjectionSnapshotQuery.ts

+const ProjectionThreadWorkUnitDbRowSchema = ProjectionThreadWorkUnit.mapFields(
+  Struct.assign({
+    providerRefs: Schema.NullOr(
+      Schema.fromJsonString(ProjectionThreadWorkUnit.fields.providerRefs),
+    ),
+  }),
+);


🟡 Medium Layers/ProjectionSnapshotQuery.ts:67

ProjectionThreadWorkUnitDbRowSchema decodes provider_refs_json using Schema.fromJsonString(ProjectionThreadWorkUnit.fields.providerRefs), but providerRefs is defined as Schema.optional(OrchestrationWorkUnitProviderRefs). Passing the optional-wrapped schema to fromJsonString is semantically wrong because JSON cannot represent undefined — the decoder will never produce a valid value for this field. Other fields like attachments use the correct pattern: Schema.NullOr(Schema.fromJsonString(...)) with the underlying type schema. Consider passing OrchestrationWorkUnitProviderRefs directly to fromJsonString.

-const ProjectionThreadWorkUnitDbRowSchema = ProjectionThreadWorkUnit.mapFields( - Struct.assign({ - providerRefs: Schema.NullOr( - Schema.fromJsonString(ProjectionThreadWorkUnit.fields.providerRefs), - ), - }), -);

🚀 Reply "fix it for me" or copy this AI Prompt for your agent:

In file apps/server/src/orchestration/Layers/ProjectionSnapshotQuery.ts around lines 67-73: `ProjectionThreadWorkUnitDbRowSchema` decodes `provider_refs_json` using `Schema.fromJsonString(ProjectionThreadWorkUnit.fields.providerRefs)`, but `providerRefs` is defined as `Schema.optional(OrchestrationWorkUnitProviderRefs)`. Passing the optional-wrapped schema to `fromJsonString` is semantically wrong because JSON cannot represent `undefined` — the decoder will never produce a valid value for this field. Other fields like `attachments` use the correct pattern: `Schema.NullOr(Schema.fromJsonString(...))` with the underlying type schema. Consider passing `OrchestrationWorkUnitProviderRefs` directly to `fromJsonString`. Evidence trail: apps/server/src/orchestration/Layers/ProjectionSnapshotQuery.ts lines 67-72 (REVIEWED_COMMIT) - shows `Schema.fromJsonString(ProjectionThreadWorkUnit.fields.providerRefs)` usage apps/server/src/persistence/Services/ProjectionThreadWorkUnits.ts line 27 (REVIEWED_COMMIT) - defines `providerRefs: Schema.optional(OrchestrationWorkUnitProviderRefs)` apps/server/src/orchestration/Layers/ProjectionSnapshotQuery.ts lines 50, 56, 63, 77 (REVIEWED_COMMIT) - shows other `fromJsonString` usages all using concrete types directly, not optional-wrapped schemas

macroscopeapp · 2026-03-18T23:04:51Z

apps/server/src/orchestration/projector.ts

+function retainThreadWorkUnitsAfterRevert(
+  workUnits: ReadonlyArray<OrchestrationThread["workUnits"][number]>,
+  retainedTurnIds: ReadonlySet<string>,
+): ReadonlyArray<OrchestrationThread["workUnits"][number]> {
+  const retained = workUnits.filter((workUnit) => retainedTurnIds.has(workUnit.turnId));
+  const retainedIds = new Set(retained.map((workUnit) => workUnit.id));
+  return retained.filter(
+    (workUnit) => workUnit.parentWorkUnitId === null || retainedIds.has(workUnit.parentWorkUnitId),
+  );
+}


🟢 Low orchestration/projector.ts:140

retainThreadWorkUnitsAfterRevert drops a parent work unit while keeping its child, leaving an orphaned work unit with an invalid parentWorkUnitId. Given work unit B (parent A) and C (parent B), reverting to retain only C's turn keeps C but removes B, so C's parentWorkUnitId points to a non-existent work unit. The parent filter needs to recursively remove descendants of any dropped work unit.

-function retainThreadWorkUnitsAfterRevert( - workUnits: ReadonlyArray<OrchestrationThread["workUnits"][number]>, - retainedTurnIds: ReadonlySet<string>, -): ReadonlyArray<OrchestrationThread["workUnits"][number]> { - const retained = workUnits.filter((workUnit) => retainedTurnIds.has(workUnit.turnId)); - const retainedIds = new Set(retained.map((workUnit) => workUnit.id)); - return retained.filter( - (workUnit) => workUnit.parentWorkUnitId === null || retainedIds.has(workUnit.parentWorkUnitId), - ); -}

🚀 Reply "fix it for me" or copy this AI Prompt for your agent:

In file apps/server/src/orchestration/projector.ts around lines 140-149: `retainThreadWorkUnitsAfterRevert` drops a parent work unit while keeping its child, leaving an orphaned work unit with an invalid `parentWorkUnitId`. Given work unit B (parent A) and C (parent B), reverting to retain only C's turn keeps C but removes B, so C's `parentWorkUnitId` points to a non-existent work unit. The parent filter needs to recursively remove descendants of any dropped work unit. Evidence trail: apps/server/src/orchestration/projector.ts lines 140-147 (REVIEWED_COMMIT): The `retainThreadWorkUnitsAfterRevert` function computes `retainedIds` from the turn-filtered set, then applies parent filtering. When a work unit is removed by the parent filter, its children still pass because `retainedIds` was computed before the parent filtering pass.

macroscopeapp · 2026-03-18T23:04:51Z

apps/server/src/orchestration/Layers/ProviderRuntimeIngestion.ts


+      if (eventTurnId !== undefined) {
+        yield* ensureRootWorkUnit(thread, event, eventTurnId, now);
+      }


🟢 Low Layers/ProviderRuntimeIngestion.ts:1122

When a turn.completed event arrives, ensureRootWorkUnit (line 1124) dispatches a work unit with state: "running" before the turn.completed handler (lines 1143-1157) dispatches the correct completion state. This produces a transient incorrect state where a completed turn's work unit is momentarily marked as running. Consider removing the ensureRootWorkUnit call for turn.completed events, since the completion handler already handles the state transition.

if (eventTurnId !== undefined) { - yield* ensureRootWorkUnit(thread, event, eventTurnId, now); + if (event.type !== "turn.completed") { + yield* ensureRootWorkUnit(thread, event, eventTurnId, now); + } }

🚀 Reply "fix it for me" or copy this AI Prompt for your agent:

In file apps/server/src/orchestration/Layers/ProviderRuntimeIngestion.ts around lines 1122-1125: When a `turn.completed` event arrives, `ensureRootWorkUnit` (line 1124) dispatches a work unit with `state: "running"` before the `turn.completed` handler (lines 1143-1157) dispatches the correct completion state. This produces a transient incorrect state where a completed turn's work unit is momentarily marked as running. Consider removing the `ensureRootWorkUnit` call for `turn.completed` events, since the completion handler already handles the state transition. Evidence trail: apps/server/src/orchestration/Layers/ProviderRuntimeIngestion.ts lines 1123-1124 (ensureRootWorkUnit called for all events with eventTurnId), lines 978-997 (ensureRootWorkUnit always dispatches state: "running" at line 991), lines 1143-1157 (turn.completed handler dispatches correct state), lines 930-976 (upsertWorkUnit dispatches to orchestrationEngine.dispatch)

macroscopeapp · 2026-03-18T23:04:51Z

apps/server/src/provider/Layers/CodexAdapter.ts

  return typeof value === "number" && Number.isFinite(value) ? value : undefined;
 }

+function toTurnId(value: string | undefined): TurnId | undefined {


🟢 Low Layers/CodexAdapter.ts:112

toTurnId checks value?.trim() for truthiness but then passes the original untrimmed value to TurnId.makeUnsafe. If the input contains leading/trailing whitespace (e.g., " abc "), the check passes but a TurnId containing whitespace is created. Consider using TurnId.makeUnsafe(value.trim()) to match the validation logic.

-function toTurnId(value: string | undefined): TurnId | undefined { - return value?.trim() ? TurnId.makeUnsafe(value) : undefined; -}

🚀 Reply "fix it for me" or copy this AI Prompt for your agent:

In file apps/server/src/provider/Layers/CodexAdapter.ts around line 112: `toTurnId` checks `value?.trim()` for truthiness but then passes the original untrimmed `value` to `TurnId.makeUnsafe`. If the input contains leading/trailing whitespace (e.g., `" abc "`), the check passes but a `TurnId` containing whitespace is created. Consider using `TurnId.makeUnsafe(value.trim())` to match the validation logic. Evidence trail: apps/server/src/provider/Layers/CodexAdapter.ts lines 112-113 (at REVIEWED_COMMIT): `function toTurnId(value: string | undefined): TurnId | undefined { return value?.trim() ? TurnId.makeUnsafe(value) : undefined; }` - validates trimmed value but passes untrimmed value. packages/contracts/src/baseSchemas.ts lines 14-29 (at REVIEWED_COMMIT): TurnId is defined via `makeEntityId` using `TrimmedNonEmptyString` which includes `Schema.Trim`, but `makeUnsafe` bypasses schema transformations.

juliusmarminge and others added 4 commits March 18, 2026 13:26

update adapter

aa66e08

Backfill missing app settings defaults during decode

654ca0d

- add decoding defaults in `AppSettingsSchema` so older persisted settings load safely - export shared `Schema.Literals` types for `EnvMode` and `TimestampFormat` - add a regression test covering pre-new-key settings hydration

maybe fix codex subturns

ef843d4

Add subagent work units across providers

e6d856b

Co-authored-by: codex <codex@users.noreply.github.com>

github-actions bot added size:XXL 1,000+ changed lines (additions + deletions). vouch:trusted PR author is trusted by repo permissions or the VOUCHED list. labels Mar 18, 2026

macroscopeapp bot reviewed Mar 18, 2026

View reviewed changes

juliusmarminge changed the title ~~[codex] Add subagent work units across providers~~ Add subagent work units across providers Mar 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add subagent work units across providers#1199

Add subagent work units across providers#1199
juliusmarminge wants to merge 4 commits intosplit-stack/claude-siblingfrom
split-stack/subagent-work-units

juliusmarminge commented Mar 18, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Mar 18, 2026

Review skipped

Uh oh!

macroscopeapp bot Mar 18, 2026

Uh oh!

macroscopeapp bot Mar 18, 2026

Uh oh!

macroscopeapp bot Mar 18, 2026

Uh oh!

macroscopeapp bot Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

juliusmarminge commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ EARLY VIBED DO NOT USE

Add subagent work units tracking across providers and projection pipeline

🗂️ Filtered Issues

Uh oh!

coderabbitai bot commented Mar 18, 2026

Review skipped

Uh oh!

macroscopeapp bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

macroscopeapp bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

macroscopeapp bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

macroscopeapp bot Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

juliusmarminge commented Mar 18, 2026 •

edited

Loading