Skip to content

Add launch + monitor commands to manage-experiment#31

Merged
elliotrfeinberg merged 3 commits into
manage-experiment-umbrellafrom
manage-experiment-launch-monitor
Jun 10, 2026
Merged

Add launch + monitor commands to manage-experiment#31
elliotrfeinberg merged 3 commits into
manage-experiment-umbrellafrom
manage-experiment-launch-monitor

Conversation

@elliotrfeinberg

Copy link
Copy Markdown
Contributor

Stacks on top of #30. Extends the manage-experiment umbrella from 2 commands (design, interpret) to 4 commands covering the full experiment lifecycle.

Why stack instead of fold into #30

#30's review focuses on the umbrella pattern. This PR adds 240+ lines of new prose (launch readiness, mid-flight safety, peeking discipline) — folding it in would force reviewers to evaluate both the structural pattern and the new domain content at the same time. Stacking keeps the review boundaries clean.

New commands

`commands/launch.md` (~125 lines)

Owns the irreversible DRAFT → ACTIVE transition. Runs the final launch-readiness check (lifted from design step 7 — that step now catches design-time issues only) and performs the launch with explicit `CONFIRM`. Recommends a tracking dashboard + a calendar reminder for monitor at the end.

`commands/monitor.md` (~115 lines)

Mid-flight safety. Answers "is it safe to keep this running?" — distinct from interpret's "did it work?" Covers:

  • Peek-safety table: what's safe to look at mid-flight (SRM, sample pace, guardrails, Sequential primaries) and what isn't (Frequentist primaries — the peeking trap).
  • Terminate-early decision rules: trustworthiness failure, guardrail regression beyond tolerance, Sequential boundary crossed. Explicit non-justifications ("the lift looks good", "the team is tired of waiting").
  • Don't-peek pushback for users who ask to see primary results mid-flight on a Frequentist test.

Knock-on changes

  • Umbrella SKILL.md: commands table now has 4 rows, menu has 5 options. Phase-derived routing extended to DRAFT-design vs DRAFT-launch and ACTIVE-monitor vs ACTIVE-interpret. Reference table "Used by" columns updated.
  • `commands/design.md` step 7: reframed as "design-time sanity check" rather than "the pre-launch check". The full launch-readiness gate now lives in launch.md. Step 8 saves a DRAFT (reversible) rather than creating-and-launching. Added step 9: hand off to launch.
  • `design.md` opening line: updated from "don't create until confirmed" → "stops at DRAFT, launch is a separate command".

Sequencing

Depends on #30. PR base is `manage-experiment-umbrella` (will rebase to master once #30 merges).

Test plan

  • `make sync-skills` clean (mixpanel-mcp / -eu / -in identical)
  • `/review-skill manage-experiment` ≥ 90%
  • Trigger surface: confirm phrases like "launch this experiment", "is it safe to keep this running", "should I peek" route to this skill
  • Phase-derived routing for DRAFT: ask design-vs-launch disambiguation if not obvious
  • Phase-derived routing for ACTIVE: route to monitor if mid-flight, interpret if reached planned end
  • Launch command refuses to launch past a blocker
  • Monitor command pushes back on Frequentist primary-metric peek requests
  • All intra-skill links resolve

🤖 Generated with Claude Code

@elliotrfeinberg elliotrfeinberg marked this pull request as ready for review June 10, 2026 01:00
@elliotrfeinberg elliotrfeinberg requested a review from gslopez June 10, 2026 01:07
elliotrfeinberg and others added 3 commits June 10, 2026 01:10
Extends the manage-experiment umbrella (PR #30) from 2 commands
(design, interpret) to 4 commands covering the full experiment
lifecycle. Stacks on top of PR #30 so the umbrella structure and
this extension can be reviewed separately.

## New commands

### commands/launch.md (~125 lines)

Owns the irreversible DRAFT → ACTIVE transition. Runs the final
launch-readiness check (lifted from design step 7 — that step now
catches design-time issues only) and performs the launch with
explicit CONFIRM. Includes the post-launch handoff: recommend a
tracking dashboard + a calendar reminder for monitor.

### commands/monitor.md (~115 lines)

Mid-flight safety checks. Answers "is it safe to keep this running?"
which is distinct from interpret's "did it work?" Covers:
- The peek-safety table: what's safe to look at mid-flight (SRM,
  sample pace, guardrails, Sequential primaries) and what isn't
  (Frequentist primaries — the peeking trap).
- The terminate-early decision rules: trustworthiness failure,
  guardrail regression beyond tolerance, Sequential boundary crossed.
- An explicit don't-peek-on-Frequentist pushback for users who ask
  to see primary results mid-flight.

## Knock-on changes

- umbrella SKILL.md: 4-row commands table, 5-option menu, phase-derived
  routing extended to DRAFT-design vs DRAFT-launch and ACTIVE-monitor
  vs ACTIVE-interpret. Reference table "Used by" columns updated
  (pitfalls → design+launch, health-check → monitor+interpret,
  sizing → design+monitor+interpret).
- commands/design.md step 7: reframed as "design-time sanity check"
  rather than "the pre-launch check". The full launch-readiness gate
  now lives in launch.md. Step 8 saves a DRAFT (reversible) rather
  than creating-and-launching. Added step 9: hand off to launch.
- design.md opening line: updated to say "stops at DRAFT" instead
  of "don't create until confirmed".

Synced to mixpanel-mcp-eu and mixpanel-mcp-in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
… emoji, hedge cross-skill refs

Applies the 5 issues surfaced by the /review-skill pass on the 4-command
manage-experiment:

## Major (D4)

The 5% guardrail hard-gate and the peek-safety table were duplicated
across design / launch / monitor — three places to keep load-bearing
thresholds in sync.

- Add a "Cross-command policies" section to the umbrella SKILL.md that
  owns the guardrail hard-gate threshold and the peek-safety table.
  Also moves the emoji convention there so all commands share one
  vocabulary.
- design.md "The >5% guardrail hard-gate" → "Guardrails (the hard-gate
  enforced downstream)" — keeps the design-time guidance, defers the
  threshold to the umbrella.
- design.md step 5: "peeking is safe by design" → "peek-safety table
  in the umbrella's Cross-command policies covers why".
- launch.md readiness checklist: ">5% hard-gate from the design command"
  → "regression hard-gate (see umbrella Cross-command policies)".
- monitor.md Components: drop the entire "What's safe to peek at"
  table, reference the umbrella. Glossary defers "peeking trap" to
  the umbrella. Terminate-early rules cite the umbrella threshold.

## Minor (D3)

design.md output-style section had a duplicate hand-off line
restating what step 9 already covers — removed.

## Minor (D7)

launch.md step 5 dashboard recommendation was unqualified — added
"recommend running create-dashboard in a follow-up session" so the
agent doesn't try to invoke it inline and interrupt the launch flow.

## Suggestion (D3)

launch.md irreversibility section mentioned "pause/resume via the
underlying feature flag" without naming the skill — added explicit
"handled by the manage-feature-flags skill" cross-ref.

## Emoji convention (D6)

Declared once in the umbrella's Cross-command policies. Existing
emoji usage across design/launch/monitor already matches; interpret
has no symbolic checklist output (its output is prose verdict + per-
metric breakdown), which is correct.

## Knock-on numbers

Skill core: 692 → 720 lines (umbrella +38 with the new policies
section, monitor -10 from removed table, others flat).

Synced to mixpanel-mcp-eu and mixpanel-mcp-in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
… example

/review-document feedback on PR #31:

- D6: monitor.md deferred the "peeking trap" definition to the umbrella's
  Cross-command policies, but the umbrella's peek-safety section only
  showed consequences and never defined the term. Added a one-line
  definition before the table.
- D8: launch.md step 5 calendar-reminder recommendation was abstract.
  Added concrete phrasing the agent can hand to the user so it doesn't
  improvise the wording inconsistently across sessions.

Synced to eu/in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@elliotrfeinberg elliotrfeinberg force-pushed the manage-experiment-launch-monitor branch from e481cda to 16b62b6 Compare June 10, 2026 01:11
@elliotrfeinberg elliotrfeinberg merged commit 1bc3dcd into manage-experiment-umbrella Jun 10, 2026
1 check passed
@elliotrfeinberg elliotrfeinberg deleted the manage-experiment-launch-monitor branch June 10, 2026 18:12
elliotrfeinberg added a commit that referenced this pull request Jun 17, 2026
* New skill: manage-experiment (umbrella over design + interpret)

Combines the soon-to-land `design-experiment` (PR #24) and
`interpret-experiment` (PR #23) skills into a single `manage-experiment`
umbrella, modeled on `manage-lexicon`. One front door for any
experiment-related phrase; subcommands for the two phases.

## Why

Today an agent has to disambiguate between two skills on every
experiment turn, and borderline phrases ("audit my experiment",
"check on experiment X") route inconsistently. Folding both into one
umbrella with `design` and `interpret` subcommands:

- Gives a single trigger surface for the agent to route to.
- Lets ambiguous phrases use phase-derived disambiguation (DRAFT →
  design; ACTIVE/CONCLUDED → interpret).
- Removes the cross-skill reference coupling that currently requires
  coordinated edits across two PRs whenever either skill renames.
- Reserves room to add `launch` and `monitor` commands later without
  another rename.

## Shape

```
manage-experiment/
├── SKILL.md              (umbrella: shared glossary, command menu, project + experiment resolution)
├── commands/
│   ├── design.md         (former design-experiment SKILL.md body)
│   └── interpret.md      (former interpret-experiment SKILL.md body)
└── references/           (15 references, flat; no name collisions)
```

The two command files preserve their original prose verbatim except for:
- Removing the now-redundant frontmatter and opening "You are helping…"
  paragraph.
- Lifting umbrella-able glossary terms (Variant, Primary/Guardrail/
  Secondary, Direction, Lift, MDE, CUPED, Winsorization, Multiple-
  testing correction) up to the umbrella; keeping phase-specific terms
  (Hypothesis, Power, Underpowered, Sequential vs Frequentist for
  design; Polarity, Significance, SRM, Retro A/A, Twyman's Law,
  Trustworthiness gate for interpret) in their command files.
- Adjusting reference paths to `../references/`.

The 15 reference files are unchanged except for two intra-skill links
in `why-no-statsig.md` that previously pointed at the `design-experiment`
skill — now point directly at `sizing.md` (where the formulas live).

The 3 references to `manage-feature-flags` (in SKILL.md, design.md, and
routing-xp-vs-ff.md) are correct cross-skill pointers and unchanged.

## Sequencing

This PR is opened **before** PR #23 and PR #24 merge. Once those land,
this PR will need a rebase against master that deletes the now-existing
`design-experiment/` and `interpret-experiment/` directories. Per the
plan, doing the umbrella as a separate PR keeps the review focused on
the routing pattern rather than tangled into either skill's prose review.

The `manage-feature-flags` skill (PR #25) cross-references both old
skill names; once both PRs land, those cross-refs will need a sweep
to point at `manage-experiment`.

README index updated: one `manage-experiment` row replaces the two
prospective `design-experiment` and `interpret-experiment` rows.

Synced to mixpanel-mcp-eu and mixpanel-mcp-in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Address review minors: add reference index, clarify routing fall-through, dedupe match logic

Applies the 3 issues surfaced by the /review-skill pass on manage-experiment:

- D4 (Centralised logic): the umbrella SKILL.md had no map of what
  references/ contains. Add a "Reference files" table to SKILL.md
  listing each file, which command uses it, and its purpose. Mirrors
  the shape used by manage-feature-flags.
- D7 (Qualified recommendations): the phase-derived rule in step 3
  ("DRAFT → design, ACTIVE/CONCLUDED → interpret") read as if it
  always applied, but it only does when an experiment is in context.
  Restate step 3 as ordered rules with explicit fall-through, and
  add a row for ambiguous verbs like "audit" / "check" / "review".
- D6 (Human-friendly identifiers): the "try ID first, then
  case-insensitive name match" rule was stated in three places (umbrella
  step 2, design step 8, interpret step 1). Drop the duplicates from
  the command files — interpret hands back to the umbrella, design
  retains only the "ask if not named" guidance (the lookup mechanics
  don't apply to free-text feature names).

Synced to mixpanel-mcp-eu and mixpanel-mcp-in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Fix Winsorization percentile convention: agent passes tail-width (5), not cap-point (95)

PR #30 review caught an agent-actionable accuracy bug. The skill docs
described Winsorization as "default 95 (cap top/bottom 5%)" — natural
phrasing for a human, but the actual `Update-Experiment` tool schema
defines `WinsorizationSettingsInput.percentile` as the **tail width to
cap on each side**, with default `5` and `exclusiveMaximum: 50`. An
agent reading the docs would pass `95`, get rejected by validation, or
misinterpret as inverted Winsorization.

Updated 6 files to match the schema's convention:

- SKILL.md shared glossary
- commands/design.md step 6 (advanced features)
- references/advanced-features.md "What it does", "How to enable",
  "Percentile guidance" (default flipped from "95th" to `percentile=5`,
  "push back below ~80" flipped to "push back above ~20")
- references/health-check-interpretation.md "Extreme winsorization
  percentile" misconfiguration check
- references/per-metric-interpretation.md "Variance-reduction & outlier
  settings that change interpretation"
- references/pitfalls.md "High variance, no Winsorization" warning

The numeric guidance ("push back if more than 20% of each tail is
capped") is preserved; only the convention name flipped to match what
the agent actually sends through the tool.

Synced to mixpanel-mcp-eu and mixpanel-mcp-in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Add launch + monitor commands to manage-experiment (#31)

* Add launch and monitor commands to manage-experiment

Extends the manage-experiment umbrella (PR #30) from 2 commands
(design, interpret) to 4 commands covering the full experiment
lifecycle. Stacks on top of PR #30 so the umbrella structure and
this extension can be reviewed separately.

## New commands

### commands/launch.md (~125 lines)

Owns the irreversible DRAFT → ACTIVE transition. Runs the final
launch-readiness check (lifted from design step 7 — that step now
catches design-time issues only) and performs the launch with
explicit CONFIRM. Includes the post-launch handoff: recommend a
tracking dashboard + a calendar reminder for monitor.

### commands/monitor.md (~115 lines)

Mid-flight safety checks. Answers "is it safe to keep this running?"
which is distinct from interpret's "did it work?" Covers:
- The peek-safety table: what's safe to look at mid-flight (SRM,
  sample pace, guardrails, Sequential primaries) and what isn't
  (Frequentist primaries — the peeking trap).
- The terminate-early decision rules: trustworthiness failure,
  guardrail regression beyond tolerance, Sequential boundary crossed.
- An explicit don't-peek-on-Frequentist pushback for users who ask
  to see primary results mid-flight.

## Knock-on changes

- umbrella SKILL.md: 4-row commands table, 5-option menu, phase-derived
  routing extended to DRAFT-design vs DRAFT-launch and ACTIVE-monitor
  vs ACTIVE-interpret. Reference table "Used by" columns updated
  (pitfalls → design+launch, health-check → monitor+interpret,
  sizing → design+monitor+interpret).
- commands/design.md step 7: reframed as "design-time sanity check"
  rather than "the pre-launch check". The full launch-readiness gate
  now lives in launch.md. Step 8 saves a DRAFT (reversible) rather
  than creating-and-launching. Added step 9: hand off to launch.
- design.md opening line: updated to say "stops at DRAFT" instead
  of "don't create until confirmed".

Synced to mixpanel-mcp-eu and mixpanel-mcp-in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Address review issues: lift cross-command policies to umbrella, unify emoji, hedge cross-skill refs

Applies the 5 issues surfaced by the /review-skill pass on the 4-command
manage-experiment:

## Major (D4)

The 5% guardrail hard-gate and the peek-safety table were duplicated
across design / launch / monitor — three places to keep load-bearing
thresholds in sync.

- Add a "Cross-command policies" section to the umbrella SKILL.md that
  owns the guardrail hard-gate threshold and the peek-safety table.
  Also moves the emoji convention there so all commands share one
  vocabulary.
- design.md "The >5% guardrail hard-gate" → "Guardrails (the hard-gate
  enforced downstream)" — keeps the design-time guidance, defers the
  threshold to the umbrella.
- design.md step 5: "peeking is safe by design" → "peek-safety table
  in the umbrella's Cross-command policies covers why".
- launch.md readiness checklist: ">5% hard-gate from the design command"
  → "regression hard-gate (see umbrella Cross-command policies)".
- monitor.md Components: drop the entire "What's safe to peek at"
  table, reference the umbrella. Glossary defers "peeking trap" to
  the umbrella. Terminate-early rules cite the umbrella threshold.

## Minor (D3)

design.md output-style section had a duplicate hand-off line
restating what step 9 already covers — removed.

## Minor (D7)

launch.md step 5 dashboard recommendation was unqualified — added
"recommend running create-dashboard in a follow-up session" so the
agent doesn't try to invoke it inline and interrupt the launch flow.

## Suggestion (D3)

launch.md irreversibility section mentioned "pause/resume via the
underlying feature flag" without naming the skill — added explicit
"handled by the manage-feature-flags skill" cross-ref.

## Emoji convention (D6)

Declared once in the umbrella's Cross-command policies. Existing
emoji usage across design/launch/monitor already matches; interpret
has no symbolic checklist output (its output is prose verdict + per-
metric breakdown), which is correct.

## Knock-on numbers

Skill core: 692 → 720 lines (umbrella +38 with the new policies
section, monitor -10 from removed table, others flat).

Synced to mixpanel-mcp-eu and mixpanel-mcp-in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* PR #31 polish: define peeking trap in umbrella, add calendar-reminder example

/review-document feedback on PR #31:

- D6: monitor.md deferred the "peeking trap" definition to the umbrella's
  Cross-command policies, but the umbrella's peek-safety section only
  showed consequences and never defined the term. Added a one-line
  definition before the table.
- D8: launch.md step 5 calendar-reminder recommendation was abstract.
  Added concrete phrasing the agent can hand to the user so it doesn't
  improvise the wording inconsistently across sessions.

Synced to eu/in.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Elliot Feinberg <5232369+elliotrfeinberg@users.noreply.github.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* manage-experiment: port statsig diagnosis depth + align to skill rubric

Port the three pieces from the analytics monorepo's statsig-diagnosis
guidance (tal-multi-546) that PR #30's why-no-statsig.md lacked, then
reconcile them with this repo's skill quality rubric.

Port:
- sizing.md: relative, unit-safe achievable-MDE formulas (MDE_relative,
  n_required) with the sigma=sqrt(sigma^2) caution, framed as explaining
  the NO verdict rather than overriding it.
- why-no-statsig.md: a "pull the diagnostic inputs first" step and a
  priority-ordered "name the single most-likely blocker" table so the
  answer is one blocker with numbers, not a generic option list.

Rubric alignment (review-skill: 60% capped -> ~98%):
- Reframe the diagnostic-inputs step to intent (no named tool call or
  response field paths) so it stays engine-agnostic, lifting the cap.
- Fix advanced-features.md winsorization percentile contradiction
  (tail-width 5 convention vs stale 95/90 boundary references).
- Fix dead/stale cross-references in per-metric-interpretation.md and
  routing-xp-vs-ff.md to point at sections/commands that exist.
- Add a Contents block to every 100+ line reference.
- Collapse the state->command mapping duplicated 3x in SKILL.md (now 200).

Synced byte-identical across mixpanel-mcp / -eu / -in.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

* manage-experiment: align prior-experiments lookup with overlap-ranking tool

The Search-Prior-Experiments engine tool (analytics #95682) ranks stored
experiments by overlap on the draft's metric set, flag key, and hypothesis
rather than free-text keyword matching. Update the design command and the
prior-experiments reference so they steer the agent to hand the tool the
draft it's building instead of constructing single-keyword queries.
Engine-agnostic wording — describes the signals, not tool names or field
paths.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

* manage-experiment: trim description under the 1024-char spec limit

The description was 1326 chars (over the Agent Skills 1024 limit) because it
enumerated ~15 verbatim trigger phrasings. Trim to a representative handful
and fold Bonferroni/Benjamini-Hochberg into "multiple-testing correction";
the Canonical commands table in the body already carries the full keyword
set. Now 921 chars. Synced across all three plugin variants.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

* manage-experiment: address skill-review findings across SKILL + references

Fixes from the review-skill rubric pass and the per-reference review:

- DRY/portability: de-link all reference->reference chains so references
  stay one level deep; strip tool-docs leakage (significance enum,
  summary.* buckets, liftConfidence, decide constants) to intent-level.
- Centralise the Winsorization >~20% push-back rule in the SKILL.md
  glossary; centralise the polarity recipe in the interpret command;
  both now referenced rather than restated.
- Source defaults: qualify the Benjamini-Hochberg platform default,
  CUPED 30-70% range, and the `up` direction default with "verify".
- Accuracy fixes in references: correct the family-wise FPR claim
  (3-arm -> 4-arm), drop in-file Winsorization duplication, fix the
  "constants"->"choices" dangling referent in lifecycle-handoff,
  soften absolutist hypothesis-framing phrasing, de-changelog the
  segment-breakdown platform-status note.

Applied identically across the mixpanel-mcp, -eu, and -in variants.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

* manage-experiment: address CRUD QA findings

Live CRUD QA against the Mixpanel MCP surfaced gaps where the skill
instructs behavior the tools handle differently than assumed:

- Settings full-replace footgun (High): the experiment-update tool's
  `settings` payload is a full replace, so a one-field edit silently
  nulls `srm`, `excludeQA`, etc. Added a read-merge-write Behaviour
  rule to SKILL.md, a pointer in design step 8, and a launch-readiness
  warning row for dropped `srm` / `excludeQA`.
- Metric `direction` not settable via inline metric creation (Medium):
  metrics default to `up`. Added a caveat in design step 3 to set
  `down` in the UI for down-polarity metrics and verify before launch.
- Archive doesn't cascade to the backing flag (Low): noted in
  lifecycle-handoff that the auto-created flag is left behind.
- Generic experiment-listing errors (Low): Behaviour rule 2 now covers
  the experiments-not-enabled failure case.

Synced across mixpanel-mcp / -eu / -in.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* manage-experiment: address review feedback (routing dedupe + glossary concision)

Resolves @zoeabrams-eng's two inline review threads on SKILL.md:

- Routing duplication: "Pick the command" (step 3) no longer restates
  the explicit/implicit/phase-derived rules already in "Canonical
  commands" — it now references the table and mapping and applies them.
- Glossary verbosity: trimmed the Winsorization entry to a one-liner
  plus a pointer to advanced-features.md (which already holds the full
  push-back rule), and tightened the Direction / CUPED / Lift /
  multiple-testing entries.

Repointed the two remaining "umbrella glossary's push-back rule"
references (design step 6, pitfalls.md) so the rule's canonical home is
advanced-features.md; kept pitfalls' mention inline to avoid a
reference->reference link chain.

Synced across mixpanel-mcp / -eu / -in.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* manage-experiment: note metric direction (polarity) is editable post-setup

Metric direction/polarity is a property of the saved metric and can now be
corrected after setup via the metric-update tool, without recreating the
metric or experiment. Update the skill to reflect this:

- design: setting `down` no longer requires the Mixpanel UI — the
  metric-update tool works on the draft
- glossary, metric-selection, launch readiness, interpret: note the
  in-place fix; full statement centralized in the Direction glossary entry,
  others trimmed to brief pointers
- use the skill's descriptive "metric-update tool" voice instead of the
  literal Update-Metric tool ID

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Elliot Feinberg <5232369+elliotrfeinberg@users.noreply.github.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant