Objective
Visually verify the Studio comparison matrix UI (PR #984) renders correctly in a browser. The API has been verified but the React component has not been tested in an actual browser.
Context
PR #984 adds a "Compare" tab to the Studio with an experiment × target matrix. The backend API returns correct data (verified via curl), but the React component (CompareTab.tsx) has not been tested in a browser because the dev environment lacks a display server for headless Chrome.
What to verify
- Start studio:
agentv studio --multi
- Register
agentv-bench-skills project (has experiment data)
- Navigate to the project → "Compare" tab
- Verify:
- Matrix renders with experiments as columns, targets as rows
- Cells are color-coded: green (>80%), yellow (50-80%), red (<50%)
- Best/worst performer indicators (▲/▼) appear
- Clicking a cell expands to show per-test-case breakdown
- Empty state renders correctly for projects with no experiments
- Error state renders when API fails
- Loading skeleton shows while data loads
Acceptance criteria
Objective
Visually verify the Studio comparison matrix UI (PR #984) renders correctly in a browser. The API has been verified but the React component has not been tested in an actual browser.
Context
PR #984 adds a "Compare" tab to the Studio with an experiment × target matrix. The backend API returns correct data (verified via curl), but the React component (
CompareTab.tsx) has not been tested in a browser because the dev environment lacks a display server for headless Chrome.What to verify
agentv studio --multiagentv-bench-skillsproject (has experiment data)Acceptance criteria