Add eval coverage for dotnet-test/test-gap-analysis#829
Conversation
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Skill Coverage Report
|
There was a problem hiding this comment.
Pull request overview
This PR extends the dotnet-test/test-gap-analysis evaluation suite by adding two new scenarios (C# “report quality” and Rust ? error propagation) plus supporting fixtures, to cover previously-uncovered SKILL.md teaching points around prioritization, trivial/generated code handling, call-chain tracing, strengths reporting, and Rust error propagation.
Changes:
- Added Scenario 5 to
eval.yamlwith a new C# Billing fixture (includes trivial members, a generated.g.csfile, and helper call chains) and rubric/assertions targeting report quality. - Added Scenario 6 to
eval.yamlwith a new Rust fixture demonstrating unobserved?error propagation and rubric/assertions to classify it as an Exception/Panic mutation point. - Introduced new fixture projects/files under
fixtures/report-qualityandfixtures/rust-error-propagation.
Show a summary per file
| File | Description |
|---|---|
| tests/dotnet-test/test-gap-analysis/fixtures/rust-error-propagation/src/lib.rs | Rust library + tests to demonstrate unobserved ?-based error propagation. |
| tests/dotnet-test/test-gap-analysis/fixtures/rust-error-propagation/Cargo.toml | Minimal Cargo manifest for the Rust fixture. |
| tests/dotnet-test/test-gap-analysis/fixtures/report-quality/Billing/InvoiceProcessor.g.cs | Auto-generated partial type stub to validate generated-code exclusion. |
| tests/dotnet-test/test-gap-analysis/fixtures/report-quality/Billing/InvoiceProcessor.cs | Billing business-logic fixture used to drive risk-prioritized gap analysis. |
| tests/dotnet-test/test-gap-analysis/fixtures/report-quality/Billing/Billing.csproj | C# project file for the Billing fixture. |
| tests/dotnet-test/test-gap-analysis/fixtures/report-quality/Billing.Tests/InvoiceProcessorTests.cs | MSTest fixture tests intentionally leaving gaps to be reported. |
| tests/dotnet-test/test-gap-analysis/fixtures/report-quality/Billing.Tests/Billing.Tests.csproj | Test project file referencing Billing + MSTest. |
| tests/dotnet-test/test-gap-analysis/eval.yaml | Adds two new evaluation scenarios and associated rubrics/assertions. |
Copilot's findings
Tip
Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
- Files reviewed: 8/8 changed files
- Comments generated: 1
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
|
/evaluate |
Skill Validation Results
[1]
Model: claude-opus-4.6 | Judge: claude-opus-4.6 🔍 Full Results - additional metrics and failure investigation steps
▶ Sessions Visualisation -- interactive replay of all evaluation sessions |
|
✅ Evaluation passed for |
Extends
tests/dotnet-test/test-gap-analysis/eval.yamlto cover eight previously-uncovered SKILL.md teaching points, adding two scenarios with supporting fixtures.Now-covered points
Validation
Common Pitfalls
?operatorWhat changed
report-quality) — new C#Billingfixture with trivial auto-properties/getters, an auto-generatedInvoiceProcessor.g.cs, and private helpers reached via a call chain. Rubric items verify the analysis excludes trivial code, skips generated code, traces call chains, prioritizes by risk, reports strengths alongside gaps, and labels mutation categories.rust-error-propagation) — new Rust fixture using the?operator with no test on the error path. Rubric items verify the?propagation is flagged as an Exception/Panic mutation point.Verification
Measure-SkillCoverage.ps1: 100% (24/24),uncoveredempty, no regressions (was 66.7%).SkillValidator check --plugin ./plugins/dotnet-test: ✅ all checks passed (only pre-existing token-size warnings).Co-authored-by: Copilot 223556219+Copilot@users.noreply.github.com