chore(claude): add ddtrace-review code review skill#17179
Draft
chore(claude): add ddtrace-review code review skill#17179
Conversation
Add a Claude Code skill that performs automated code review for dd-trace-py PRs, trained on 6 months of human review comments (5,974 comments across 1,083 PRs). Uses a progressive-disclosure approach with three review passes of increasing depth. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Codeowners resolved as |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
Add a Claude Code skill (
ddtrace-review) that performs automated code review for dd-trace-py PRs. The skill was built by mining 6 months of human review comments from this repo and distilling the most common patterns into a progressive-disclosure checklist.How it was created
Data collection: Fetched all review comments from the past 6 months using the GitHub API — 5,974 inline comments and 1,409 review summaries across 1,083 PRs.
Pattern extraction: Analyzed comment frequency by category:
Skill design: Built a three-pass progressive-disclosure review:
Iterative evaluation: Ran the skill against 20 real PRs (4 rounds of 5 PRs each) from different areas of the codebase, comparing output against actual human reviewer comments. Iterated on the skill after each round.
Eval results
Tested against 20 PRs spanning profiling, contrib integrations, core internals, AppSec, LLMObs, openfeature, and CI/config:
Coverage of total comments
Strongest areas
Weakest areas
Testing
Evaluated against 20 real PRs from the past 6 months with a grading script that compares skill output against actual human reviewer comments. The grading measures file-level and concept-level coverage of pattern-coverable review comments.
Risks
None — this is a Claude Code skill file only (
.claude/skills/). It has no runtime impact on the library. It only affects Claude Code sessions that invoke/ddtrace-review.Additional Notes
The skill can be invoked with
/ddtrace-reviewor triggers automatically when asking Claude Code to review dd-trace-py changes. It uses the same P1/P2/P3 severity format that human reviewers use on this repo.🤖 Generated with Claude Code