feat(autoreview): add Cursor review engine#21
Conversation
Add a Cursor-backed autoreview path with fail-closed workspace and MCP gates, stream/json parsing, and deterministic parser tests. This lets trusted repos use Cursor for closeout review while preserving structured validation and live proof workflows. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>
Limit the new Cursor extraction logic to explicit report candidates, keep assistant-message fallback behind terminal-result precedence, and keep reviewer-all opt-in behavior compatible with existing panels. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>
Prefer terminal stream payloads over assistant drafts, raise on malformed terminal results so Cursor retries can fire, and cover the fallback behavior with a regression test. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>
|
Live proof from the final local Cursor-backed implementation. ValidatorsCommand set:
Current branch reviewed with CursorCommand:
Malicious fixture, Cursor finds real issuesCommand:
Benign fixture, Cursor stays cleanCommand:
|
|
Codex review: found issues before merge. Reviewed June 18, 2026, 9:55 AM ET / 13:55 UTC. Summary Reproducibility: not applicable. This PR adds a new optional review engine rather than fixing a reproducible bug. The contributor did provide live terminal output showing Cursor-backed branch review and fixture behavior after the implementation. Review metrics: 2 noteworthy metrics.
Merge readiness Overall follows the weaker of proof and patch quality, so missing proof can cap an otherwise strong patch. Rank-up moves:
Risk before merge
Maintainer options:
Next step before merge
Security Review findings
Review detailsBest possible solution: Rebase Cursor support onto current main, run cursor-agent through the same sanitized engine environment as the other external reviewers, preserve the explicit fail-closed trust gates, and then let maintainers decide whether to accept Cursor as an optional engine. Do we have a high-confidence way to reproduce the issue? Not applicable; this PR adds a new optional review engine rather than fixing a reproducible bug. The contributor did provide live terminal output showing Cursor-backed branch review and fixture behavior after the implementation. Is this the best way to solve the issue? No, not as submitted. The direction is plausible, but the branch must be rebased onto current main and the Cursor execution path should adopt the current sanitized engine environment before maintainers decide on the feature. Full review comments:
Overall correctness: patch is incorrect AGENTS.md: found and applied where relevant. Codex review notes: model internal, reasoning high; reviewed against 844dfb22a969. Label changesLabel changes:
Label justifications:
Evidence reviewedSecurity concerns:
What I checked:
Likely related people:
What the crustacean ranks mean
Shiny media proof means a screenshot, video, or linked artifact directly shows the changed behavior. Runtime, network, CSP, and security claims still need visible diagnostics. How this review workflow works
|
Summary
cursor-agentask-mode executionValidation
scripts/validate-skillsruby -c scripts/install-skillsruby -c scripts/validate-skillsruby - <<'RUBY' ... load \"scripts/install-skills.test.rb\" ... RUBY(local Ruby 2.6 compatibility shim forassert_path_exists; CI uses Ruby 3.3)bash -n skills/autoreview/scripts/test-review-harnesspython3 -m py_compile skills/autoreview/scripts/autoreview skills/autoreview/scripts/test-review-harness.py skills/autoreview/scripts/autoreview_test.pypython3 -m unittest skills/autoreview/scripts/autoreview_test.pynode --check skills/agent-transcript/scripts/agent-transcriptnode --test skills/agent-transcript/scripts/agent-transcript.test.mjs skills/session-viewer/scripts/session-viewer.test.tspython3 skills/autoreview/scripts/autoreview --mode branch --base origin/main --engine cursor --model auto --cursor-allow-workspace-instructions --stream-engine-outputI will add a follow-up PR comment with live proof logs from the local Cursor runs.