You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: update all stale Outside Voice references to include Gemini fallback chain
Updated Outside Voice description in Review Readiness Dashboard sections
across plan-eng-review, plan-ceo-review, ship, plan-design-review, and
design-review SKILL.md files to reflect the new fallback chain:
Codex → Gemini (gemini-3.1-pro-preview) → Claude subagent.
@@ -910,16 +910,119 @@ Record baseline design score and AI slop score at end of Phase 6.
910
910
911
911
## Design Outside Voices (parallel)
912
912
913
-
**Automatic:** Outside voices run automatically when Codex is available. No opt-in needed.
913
+
**Automatic:** Outside voices run automatically when Codex or Gemini is available. No opt-in needed.
914
914
915
-
**Check Codex availability:**
915
+
**Check tool availability:**
916
916
```bash
917
917
which codex 2>/dev/null &&echo"CODEX_AVAILABLE"||echo"CODEX_NOT_AVAILABLE"
918
+
which gemini 2>/dev/null &&echo"GEMINI_AVAILABLE"||echo"GEMINI_NOT_AVAILABLE"
918
919
```
919
920
920
-
**If Codex is available**, launch both voices simultaneously:
921
+
Launch all available voices simultaneously:
921
922
922
-
1.**Codex design voice** (via Bash):
923
+
1.**Gemini visual voice** (PRIORITY — via Bash, if GEMINI_AVAILABLE):
924
+
925
+
Gemini 3.1 Pro has the strongest multimodal vision of any model. Use it as the **primary visual reviewer** — feed actual screenshots from Phase 1 and Phase 3 directly to Gemini for pixel-level design analysis. This catches issues that source-code-only reviewers miss: visual hierarchy balance, whitespace rhythm, color harmony, compositional weight, and AI slop patterns.
926
+
927
+
**Collect screenshots from previous phases:**
928
+
```bash
929
+
SCREENSHOTS=$(find "$REPORT_DIR/screenshots" -name "*.png" -type f 2>/dev/null | head -5)
"You are a world-class product designer doing a VISUAL audit of these screenshots. You are looking at the RENDERED output, not source code. Evaluate what you SEE:
950
+
951
+
VISUAL HIERARCHY & COMPOSITION:
952
+
- Where does the eye land first? Is that intentional?
953
+
- Is there clear primary > secondary > tertiary hierarchy?
954
+
- White space rhythm: intentional or accidental gaps?
955
+
- Squint test: does the layout hold when blurred?
956
+
- Compositional weight: balanced or lopsided?
957
+
958
+
COLOR & HARMONY:
959
+
- Is the palette cohesive across all screenshots?
960
+
- Contrast ratios: any text barely readable?
961
+
- Semantic color usage: red=error, green=success consistent?
962
+
- Dark mode: surfaces use elevation, not just inverted?
963
+
964
+
TYPOGRAPHY AS RENDERED:
965
+
- Type scale feels systematic or random?
966
+
- Line lengths readable (45-75 chars)?
967
+
- Font pairing: complementary or competing?
968
+
- Heading hierarchy visually clear?
969
+
970
+
SPACING & RHYTHM:
971
+
- Grid alignment: everything snaps or drifts?
972
+
- Consistent padding/margin rhythm?
973
+
- Border-radius hierarchy or uniform bubbly?
974
+
- Inner radius = outer radius - gap on nested elements?
975
+
976
+
AI SLOP DETECTION (the 10 anti-patterns — visual check):
"Review the frontend source code in this repo. You are an independent senior product designer doing a source-code design audit. Focus on CONSISTENCY PATTERNS across files rather than individual violations:
964
1067
- Are spacing values systematic across the codebase?
@@ -972,22 +1075,25 @@ For each finding: what's wrong, severity (critical/high/medium), and the file:li
972
1075
-**Auth failure:** If stderr contains "auth", "login", "unauthorized", or "API key": "Codex authentication failed. Run `codex login` to authenticate."
973
1076
-**Timeout:** "Codex timed out after 5 minutes."
974
1077
-**Empty response:** "Codex returned no response."
975
-
- On any Codex error: proceed with Claude subagent output only, tagged `[single-model]`.
976
-
- If Claude subagent also fails: "Outside voices unavailable — continuing with primary review."
1078
+
- On any Codex error: proceed with Gemini and/or Claude subagent output only.
1079
+
- On any Gemini error: proceed with Codex and/or Claude subagent output only.
1080
+
- If all outside voices fail: "Outside voices unavailable — continuing with primary review."
977
1081
1082
+
Present Gemini output under a `GEMINI SAYS (visual audit — pixel-level):` header.
978
1083
Present Codex output under a `CODEX SAYS (design source audit):` header.
979
1084
Present subagent output under a `CLAUDE SUBAGENT (design consistency):` header.
980
1085
981
1086
**Synthesis — Litmus scorecard:**
982
1087
983
-
Use the same scorecard format as /plan-design-review (shown above). Fill in from both outputs.
984
-
Merge findings into the triage with `[codex]` / `[subagent]` / `[cross-model]` tags.
1088
+
Use the same scorecard format as /plan-design-review (shown above). Fill in from all outputs.
1089
+
Gemini's visual litmus checks take PRIORITY for visual dimensions (hierarchy, composition, color, spacing) because it evaluates the rendered output, not source code.
1090
+
Merge findings into the triage with `[gemini-visual]` / `[codex]` / `[subagent]` / `[cross-model]` tags.
Replace STATUS with "clean" or "issues_found", SOURCE with "codex+subagent", "codex-only", "subagent-only", or "unavailable".
1096
+
Replace STATUS with "clean" or "issues_found", SOURCE with "gemini+codex+subagent", "gemini+subagent", "gemini+codex", "codex+subagent", "gemini-only", "codex-only", "subagent-only", or "unavailable".
Copy file name to clipboardExpand all lines: plan-ceo-review/SKILL.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1240,7 +1240,7 @@ Parse the output. Find the most recent entry for each skill (plan-ceo-review, pl
1240
1240
-**CEO Review (optional):** Use your judgment. Recommend it for big product/business changes, new user-facing features, or scope decisions. Skip for bug fixes, refactors, infra, and cleanup.
1241
1241
-**Design Review (optional):** Use your judgment. Recommend it for UI/UX changes. Skip for backend-only, infra, or prompt-only changes.
1242
1242
-**Adversarial Review (automatic):** Auto-scales by diff size. Small diffs (<50 lines) skip adversarial. Medium diffs (50–199) get cross-model adversarial. Large diffs (200+) get all 4 passes: Claude structured, Codex structured, Claude adversarial subagent, Codex adversarial. No configuration needed.
1243
-
-**Outside Voice (optional):** Independent plan review from a different AI model. Offered after all review sections complete in /plan-ceo-review and /plan-eng-review. Falls back to Claude subagent if Codex is unavailable. Never gates shipping.
1243
+
-**Outside Voice (optional):** Independent plan review from a different AI model. Offered after all review sections complete in /plan-ceo-review and /plan-eng-review. Fallback chain: Codex → Gemini (gemini-3.1-pro-preview) → Claude subagent. Never gates shipping.
1244
1244
1245
1245
**Verdict logic:**
1246
1246
-**CLEARED**: Eng Review has >= 1 entry within 7 days from either \`review\` or \`plan-eng-review\` with status "clean" (or \`skip_eng_review\` is \`true\`)
0 commit comments