Fix the 15 weakest skills (audit-driven, fabrication-checked) by elementalsouls · Pull Request #32 · elementalsouls/Claude-BugHunter

elementalsouls · 2026-06-07T08:45:00Z

Rewrites the 15 weakest skills identified by the multi-agent quality audit (71 skills scored on a 6-dimension rubric, weak/error flags adversarially re-checked). These scored 2.17–2.50/5, almost all with grounding=1 (no real citations), thin validation, and confirmed correctness bugs.

Process (audit → fix → adversarial verify → hand-correct)

Fix — one expert agent per skill, with hard no-fabrication rules.
Verify — a separate adversarial agent hunted each fixed file for invented CVEs/citations and new errors. 7/15 came back clean; 8 were flagged — exactly what the pass is for.
Hand-correct — I fixed all 8 flags myself and independently checked the citations.

Confirmed correctness bugs fixed (sample)

Skill	Bug fixed
`hunt-k8s`	broken kubelet `/exec` (SPDY stream, not plain POST) → `/run` primitive
`hunt-cicd`	`cat $GITHUB_TOKEN` (env var, not a file)
`hunt-grpc`	h2load≠Rapid-Reset; wrong tool flags + wrong `cloudflare/quiche` URL
`hunt-cors`	wrong regex-bypass example
`hunt-websocket`	wrong socket.io `nsp=` syntax; removed mischaracterized CVE-2017-16031
`hunt-dom`	broken cross-origin Service-Worker payload; corrected DOM-clobbering example
`hunt-tls-network`	CAA mischaracterization; replaced dead `openssl -ssl2/-ssl3` flags
`hunt-ldap`	`{SSHA}` hashcat mode `1411`→`111`
`hunt-lfi`	misleading `php://input`; added filter-chain RCE; fixed invented "Gaztech" credit

Anti-fabrication (the #1 risk, given the audit's findings)

Added only well-known, verifiable CVEs (Heartbleed/POODLE/DROWN/FREAK/SWEET32, Apache 2.4.49/.50, runc Leaky Vessels, jQuery 11022/11023, Jenkins CVE-2024-23897, …).
Removed two HackerOne report IDs (#226659, #281575) that I could not independently verify (HackerOne pages are JS SPAs — unverifiable via fetch). No invented report IDs, payouts, or statistics.

Quality gates

All 15: lint clean (0 errors), descriptions ≤1024, bodies ≤500 lines, YAML-safe (strict/Codex parsers).

Note: independent of PR #31 (multi-harness) — different files, both branch off main.

🤖 Generated with Claude Code

The multi-agent quality audit (71 skills, 6-dimension rubric, adversarially re-checked) flagged these 15 as weakest (2.17–2.50/5), almost all sharing: grounding=1 (no real citations), thin validation, and confirmed correctness bugs. Process: audit → expert fixer per skill (HARD no-fabrication rules) → adversarial verifier hunting each fixed file for invented CVEs/citations/new errors → manual correction of every flag. 7/15 came back verifier-clean; the other 8 had issues the verifier caught (this is the point of the pass) — all hand-corrected here. Confirmed correctness bugs fixed (examples): - hunt-k8s: broken kubelet /exec (SPDY stream, not plain POST) → /run primitive - hunt-cicd: `cat $GITHUB_TOKEN` (env var, not a file) corrected - hunt-grpc: h2load is not a Rapid-Reset PoC; fixed tool + wrong CLI flags + quiche URL - hunt-cors: wrong regex-bypass example corrected - hunt-websocket: wrong socket.io `nsp=` syntax; removed mischaracterized CVE-2017-16031 - hunt-dom: broken cross-origin Service-Worker payload; corrected DOM-clobbering example - hunt-tls-network: CAA mischaracterization; replaced dead `openssl -ssl2/-ssl3` flags - hunt-ldap: `{SSHA}` hashcat mode 1411→111 (1411 is {SSHA256}) - hunt-lfi: misleading php://input; added filter-chain RCE; fixed invented "Gaztech" credit Grounding: added only well-known, verifiable CVEs (Heartbleed/POODLE/DROWN/FREAK/SWEET32, Apache 2.4.49/.50 path traversal, runc Leaky Vessels, jQuery 11022/11023, Jenkins CVE-2024-23897, etc.). Anti-fabrication enforced: REMOVED two HackerOne report IDs (#226659, #281575) that could not be independently verified — no invented report IDs, payouts, or stats. Validation/false-positive discipline strengthened throughout. All 15: lint clean, descriptions ≤1024, bodies ≤500 lines, YAML-safe (strict/Codex). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Second accuracy pass from the multi-agent audit — the confirmed correctness errors in otherwise-adequate/strong skills (the weakest 15 were PR elementalsouls#32). Same pipeline: surgical fix → adversarial verify → hand-correct. 14/21 verifier-clean; 3 flagged (bugcrowd §8.1 stale ordering, meme-coin SUAVE overstatement, hunt-saml partial) all hand-corrected. Diffs are intentionally small (182+/132−). SEVERE — fabrications / wrong identifiers: - okta-attack: fabricated `CVE-2024-VERIFY` -> real CVE-2024-10327 (Okta Verify iOS push-notification bypass; web-verified on NVD) - enterprise-vpn-attack: `CVE-2024-46805` -> CVE-2023-46805 (Ivanti, the real number) - supply-chain-attack-recon: SUNBURST no longer mislabeled CVE-2020-10148; cite CISA AA20-352A - meme-coin-audit: removed invented "35%/25%/20%" stats + phantom tool refs; SUAVE de-overstated - hunt-subdomain: removed an UNVERIFIABLE HackerOne report id (#1487793) — kept the technique - bugcrowd-reporting: fixed backwards chain-submission UUID ordering (§5.1 AND §8.1) - hunt-business-logic: corrected the "HMAC replay with modified payload bypasses" claim HIGH — stale facts / backwards logic / class conflations: - hunt-xss (removed-2019 Chrome XSS Auditor), hunt-xxe (.NET XmlReader DtdProcessing default), triage-validation (CVSS vectors recomputed), hunt-ssti/hunt-springboot (backwards `#{7*7}` logic), hunt-open-redirect (CRLF != open-redirect), hunt-saml (gzip/comment-injection corrected; description de-overclaimed), hunt-cache-poison, hunt-mfa-bypass, hunt-nextjs, redteam-mindset, vmware-vcenter-attack, mid-engagement-ir-detection, hunt-sharepoint. Anti-fabrication enforced: every added identifier is a well-known real CVE (Ivanti 2023-46805, Next.js 2024-34351, SharePoint ToolShell 2025-49704/49706, ruby-saml 2017-11428, Okta 2024-10327) or a real advisory (CISA AA20-352A); the one unverifiable HackerOne id was removed, not kept. All 71 lint clean (incl. the new YAML-safety check), descriptions <=1024, bodies <=500. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

elementalsouls merged commit d2b44ea into main Jun 7, 2026
1 check passed

elementalsouls deleted the fix-weakest-skills branch June 7, 2026 08:53

elementalsouls mentioned this pull request Jun 7, 2026

Fix 21 SEVERE+HIGH correctness errors (surgical, audit-driven) #33

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix the 15 weakest skills (audit-driven, fabrication-checked)#32

Fix the 15 weakest skills (audit-driven, fabrication-checked)#32
elementalsouls merged 1 commit into
mainfrom
fix-weakest-skills

elementalsouls commented Jun 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

elementalsouls commented Jun 7, 2026

Process (audit → fix → adversarial verify → hand-correct)

Confirmed correctness bugs fixed (sample)

Anti-fabrication (the #1 risk, given the audit's findings)

Quality gates

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants