Enforce VM size minimum during node selection by simple-agent-manager[bot] · Pull Request #875 · raphaeltm/simple-agent-manager

simple-agent-manager · 2026-05-01T19:20:34Z

Summary

Treat requested VM size as a minimum capacity during node reuse.
Allow larger nodes to satisfy smaller requests, while preventing smaller nodes from satisfying larger requests.
Add shared VM size satisfaction helper, behavioral selector/TaskRunner regression tests, task tracking, post-mortem, and a process rule update.

Validation

pnpm lint
pnpm typecheck
pnpm test
Additional validation run (if applicable)

Additional validation:

pnpm --filter @simple-agent-manager/shared test -- vm-sizes
pnpm --filter @simple-agent-manager/api test -- durable-objects/task-runner-node-selection services/node-selector
pnpm --filter @simple-agent-manager/shared typecheck
pnpm --filter @simple-agent-manager/api typecheck
git diff --check

Staging Verification (REQUIRED for all code changes — merge-blocking)

Staging deployment green — Deploy Staging workflow triggered manually and passed for this branch
Live app verified via Playwright — staging workflow smoke-tests logged into app.sammy.party and passed
Existing workflows confirmed working — staging workflow smoke-tests passed against the live app
New feature/fix verified on staging — deployment/smoke path verified on staging; VM size minimum behavior verified by behavioral regression tests rather than live VM provisioning
Infrastructure verification completed — N/A: no infrastructure paths changed
Mobile and desktop verification notes added for UI changes — N/A: no runtime UI changes

Staging Verification Evidence

Deploy Staging workflow passed: https://github.com/raphaeltm/simple-agent-manager/actions/runs/25232215432
Deploy to Cloudflare job passed.
Staging smoke-tests job passed.
The staging workflow deployed code commit 2f89ce1a; later commits 148fdd36, af7412f1, and 95097497 only refreshed PR/task evidence and added a backlog deferral for audit persistence.
Local one-off browser smoke reached token-login successfully, then stopped because this workspace does not have the Playwright Chromium binary installed. The GitHub staging smoke-tests job is the browser-backed live staging evidence.
Full live VM provisioning was not run to avoid staging node quota/cost; the VM-size behavior is covered by behavioral tests for helper semantics, standalone selector warm/existing reuse, and TaskRunner preferred/warm/existing reuse.

UI Compliance Checklist (Required for UI changes)

N/A: no runtime UI changes. packages/ui/tests/ButtonGroup.test.tsx was updated only to tolerate equivalent CSS zero-value serialization (0 / 0px).

End-to-End Verification (Required for multi-component changes)

Data flow traced from user input to final outcome with code path citations (see .claude/rules/10-e2e-verification.md)
Capability test exercises the complete happy path across system boundaries
All spec/doc assumptions about existing behavior verified against code (not just "read the code")
If any gap exists between automated test coverage and full E2E, manual verification steps documented below

Data Flow Trace

Project/task VM size resolves before TaskRunner start:
apps/api/src/routes/tasks/submit.ts resolves explicit task value, then agent profile value, then project.defaultVmSize, then platform default.
TaskRunner receives the resolved minimum size:
apps/api/src/services/task-runner-do.ts:startTaskRunnerDO() forwards TaskRunConfig.vmSize.
Preferred-node reuse rejects undersized nodes:
apps/api/src/durable-objects/task-runner/node-steps.ts:handleNodeSelection() reads nodes.vm_size and calls canSatisfyVmSize(node.vm_size, state.config.vmSize).
Warm-node reuse rejects undersized nodes:
apps/api/src/durable-objects/task-runner/node-steps.ts:tryClaimWarmNode() filters warm candidates with canSatisfyVmSize.
Existing-node reuse rejects undersized nodes:
apps/api/src/durable-objects/task-runner/node-steps.ts:findNodeWithCapacity() filters running candidates with canSatisfyVmSize.
Standalone selector follows the same rule:
apps/api/src/services/node-selector.ts:selectNodeForTaskRun() filters warm and existing candidates with canSatisfyVmSize.

Untested Gaps

Full live VM provisioning was not run to avoid staging node quota/cost. Automated coverage exercises helper semantics, standalone selector warm/existing paths, and TaskRunner preferred/warm/existing paths with mixed VM sizes.

Post-Mortem (Required for bug fix PRs)

What broke

Node reuse treated requested VM size as a soft preference. A larger request could reuse a smaller node when no exact-size node was selected first.

Root cause

Standalone and TaskRunner node selection sorted by exact VM-size match but did not reject undersized candidates before reuse.

Class of bug

Selection logic treated a hard compatibility constraint as a ranking preference.

Why it wasn't caught

Existing tests covered helper-like behavior and source structure, but did not execute mixed-size node selection paths with incompatible candidates.

Process fix included in this PR

.claude/rules/10-e2e-verification.md now requires behavioral tests for compatibility constraints in selection logic, including incompatible candidates that would otherwise rank well.

Post-mortem file

docs/notes/2026-05-01-vm-size-minimum-selection-postmortem.md

Specialist Review Evidence (Required for agent-authored PRs)

All dispatched reviewers completed and findings addressed before merge
If any reviewer did NOT complete: needs-human-review label added and merge deferred to human — resolved after successful retry

Reviewer	Status	Outcome
cloudflare-specialist	PASS	No blocking findings; D1/DO usage remains parameterized and no migration/binding changes required.
constitution-validator	ADDRESSED	Initial duplicate `VM_SIZE_RANK` finding fixed by deriving ordering from `DEFAULT_VM_SIZE_VCPUS`; re-check passed.
test-engineer	ADDRESSED	Initial behavioral coverage findings fixed with standalone selector and TaskRunner tests; re-check passed.
task-completion-validator	PASS	Retry found one uncovered research finding about audit persistence; fixed by adding `tasks/backlog/2026-05-01-persist-task-requested-vm-size.md` and linking the explicit deferral from the active task. Final re-check passed with no blocking findings.

Exceptions (If any)

Scope: UI visual audit
Rationale: only UI test assertion changed; no runtime UI component/source change.
Expiration: This PR only.

Agent Preflight (Required)

Preflight completed before code changes

Classification

External References

N/A: no external API behavior changed. Investigation used repo code and staging D1 state.

Codebase Impact Analysis

Affected paths:

packages/shared/src/constants/vm-sizes.ts
apps/api/src/services/node-selector.ts
apps/api/src/durable-objects/task-runner/node-steps.ts
API/shared tests for VM size minimum semantics
Task tracking and bug-fix process docs

Documentation & Specs

Updated:

tasks/active/2026-05-01-vm-size-minimum-selection.md
docs/notes/2026-05-01-vm-size-minimum-selection-postmortem.md
.claude/rules/10-e2e-verification.md

Constitution & Risk Check

Checked Principle XI. Initial review found a duplicate hardcoded VM-size rank table; fixed by deriving ordering from the existing DEFAULT_VM_SIZE_VCPUS source of truth.

sonarqubecloud · 2026-05-01T21:39:44Z

Quality Gate passed

Issues
3 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
1.9% Duplication on New Code

See analysis details on SonarQube Cloud

raphaeltm added 6 commits May 1, 2026 19:20

Enforce VM size minimum during node selection

d27d829

task: track VM size minimum selection

7b1583d

test: tolerate zero radius serialization

9056462

test: cover VM size selection behavior

af22081

docs: record VM size selection postmortem

e6d59e8

task: update VM size validation evidence

2f89ce1

simple-agent-manager Bot added the needs-human-review Agent could not complete all review gates — human must approve before merge label May 1, 2026

simple-agent-manager Bot temporarily deployed to staging May 1, 2026 20:40 Inactive

raphaeltm added 3 commits May 1, 2026 20:44

ci: refresh PR evidence

148fdd3

task: record staging verification evidence

af7412f

task: defer VM size audit persistence

9509749

simple-agent-manager Bot removed the needs-human-review Agent could not complete all review gates — human must approve before merge label May 1, 2026

ci: refresh specialist review evidence

9aaef38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enforce VM size minimum during node selection#875

Enforce VM size minimum during node selection#875
simple-agent-manager[bot] wants to merge 10 commits intomainfrom
sam/possible-opinion-bug-terms-01kqje

simple-agent-manager Bot commented May 1, 2026 •

edited

Loading

Uh oh!

sonarqubecloud Bot commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

simple-agent-manager Bot commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Staging Verification (REQUIRED for all code changes — merge-blocking)

Staging Verification Evidence

UI Compliance Checklist (Required for UI changes)

End-to-End Verification (Required for multi-component changes)

Data Flow Trace

Untested Gaps

Post-Mortem (Required for bug fix PRs)

What broke

Root cause

Class of bug

Why it wasn't caught

Process fix included in this PR

Post-mortem file

Specialist Review Evidence (Required for agent-authored PRs)

Exceptions (If any)

Agent Preflight (Required)

Classification

External References

Codebase Impact Analysis

Documentation & Specs

Constitution & Risk Check

Uh oh!

sonarqubecloud Bot commented May 1, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

simple-agent-manager Bot commented May 1, 2026 •

edited

Loading