Fix 21 SEVERE+HIGH correctness errors (surgical, audit-driven) by elementalsouls · Pull Request #33 · elementalsouls/Claude-BugHunter

elementalsouls · 2026-06-07T15:16:08Z

Second accuracy pass from the multi-agent audit: the 21 confirmed correctness errors in otherwise-adequate/strong skills (the weakest-15 were #32). Same pipeline — surgical fix → adversarial verify → hand-correct — but tuned to touch only the confirmed error (diffs are deliberately tiny: 182+/132− across 21 files). 14/21 came back verifier-clean; 3 were flagged and hand-corrected.

🔴 SEVERE — fabrications / wrong identifiers

Skill	Fix
`okta-attack`	fabricated `CVE-2024-VERIFY` → real `CVE-2024-10327` (Okta Verify iOS push bypass, verified on NVD)
`enterprise-vpn-attack`	`CVE-2024-46805` → `CVE-2023-46805` (Ivanti, the real number)
`supply-chain-attack-recon`	SUNBURST no longer mislabeled `CVE-2020-10148`; cite CISA AA20-352A
`meme-coin-audit`	removed invented "35%/25%/20%" stats + phantom tool refs
`hunt-subdomain`	removed an unverifiable HackerOne id (`#1487793`) — kept the technique
`bugcrowd-reporting`	fixed backwards chain-submission UUID ordering (§5.1 and §8.1)
`hunt-business-logic`	corrected the "HMAC replay with modified payload bypasses" claim

🟠 HIGH — stale facts / backwards logic / class conflations (14)

hunt-xss (removed-2019 XSS Auditor), hunt-xxe (.NET XmlReader default), triage-validation (CVSS vectors recomputed), hunt-ssti/hunt-springboot (backwards #{7*7}), hunt-open-redirect (CRLF≠open-redirect), hunt-saml (gzip/comment-injection + de-overclaimed description), hunt-cache-poison, hunt-mfa-bypass, hunt-nextjs, redteam-mindset, vmware-vcenter-attack, mid-engagement-ir-detection, hunt-sharepoint.

What the adversarial verifier caught (and I hand-fixed)

bugcrowd-reporting — §5.1 fixed but §8.1 left contradicting it → updated §8.1 too
meme-coin-audit — fixer introduced "Flashbots SUAVE as live 2026 infra" (it's testnet) → removed
hunt-saml — needs-rework: core fixes good, but description still over-claimed XSW1-XSW8 / "real paid examples" → trimmed to match the body

Anti-fabrication

Every added identifier is a well-known real CVE or a real advisory (verified the two riskiest — CVE-2024-10327, CVE-2023-46805 — and removed the one unverifiable HackerOne id rather than keep it).

Gates: all 71 lint-clean (incl. the YAML-safety check), descriptions ≤1024, bodies ≤500.

Second accuracy pass from the multi-agent audit — the confirmed correctness errors in otherwise-adequate/strong skills (the weakest 15 were PR #32). Same pipeline: surgical fix → adversarial verify → hand-correct. 14/21 verifier-clean; 3 flagged (bugcrowd §8.1 stale ordering, meme-coin SUAVE overstatement, hunt-saml partial) all hand-corrected. Diffs are intentionally small (182+/132−). SEVERE — fabrications / wrong identifiers: - okta-attack: fabricated `CVE-2024-VERIFY` -> real CVE-2024-10327 (Okta Verify iOS push-notification bypass; web-verified on NVD) - enterprise-vpn-attack: `CVE-2024-46805` -> CVE-2023-46805 (Ivanti, the real number) - supply-chain-attack-recon: SUNBURST no longer mislabeled CVE-2020-10148; cite CISA AA20-352A - meme-coin-audit: removed invented "35%/25%/20%" stats + phantom tool refs; SUAVE de-overstated - hunt-subdomain: removed an UNVERIFIABLE HackerOne report id (#1487793) — kept the technique - bugcrowd-reporting: fixed backwards chain-submission UUID ordering (§5.1 AND §8.1) - hunt-business-logic: corrected the "HMAC replay with modified payload bypasses" claim HIGH — stale facts / backwards logic / class conflations: - hunt-xss (removed-2019 Chrome XSS Auditor), hunt-xxe (.NET XmlReader DtdProcessing default), triage-validation (CVSS vectors recomputed), hunt-ssti/hunt-springboot (backwards `#{7*7}` logic), hunt-open-redirect (CRLF != open-redirect), hunt-saml (gzip/comment-injection corrected; description de-overclaimed), hunt-cache-poison, hunt-mfa-bypass, hunt-nextjs, redteam-mindset, vmware-vcenter-attack, mid-engagement-ir-detection, hunt-sharepoint. Anti-fabrication enforced: every added identifier is a well-known real CVE (Ivanti 2023-46805, Next.js 2024-34351, SharePoint ToolShell 2025-49704/49706, ruby-saml 2017-11428, Okta 2024-10327) or a real advisory (CISA AA20-352A); the one unverifiable HackerOne id was removed, not kept. All 71 lint clean (incl. the new YAML-safety check), descriptions <=1024, bodies <=500. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

elementalsouls merged commit 5dfa9f4 into main Jun 7, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix 21 SEVERE+HIGH correctness errors (surgical, audit-driven)#33

Fix 21 SEVERE+HIGH correctness errors (surgical, audit-driven)#33
elementalsouls merged 1 commit into
mainfrom
fix-accuracy-severe-high

elementalsouls commented Jun 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

elementalsouls commented Jun 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔴 SEVERE — fabrications / wrong identifiers

🟠 HIGH — stale facts / backwards logic / class conflations (14)

What the adversarial verifier caught (and I hand-fixed)

Anti-fabrication

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

elementalsouls commented Jun 7, 2026 •

edited

Loading