Skip to content

Fix 21 SEVERE+HIGH correctness errors (surgical, audit-driven)#33

Merged
elementalsouls merged 1 commit into
mainfrom
fix-accuracy-severe-high
Jun 7, 2026
Merged

Fix 21 SEVERE+HIGH correctness errors (surgical, audit-driven)#33
elementalsouls merged 1 commit into
mainfrom
fix-accuracy-severe-high

Conversation

@elementalsouls

@elementalsouls elementalsouls commented Jun 7, 2026

Copy link
Copy Markdown
Owner

Second accuracy pass from the multi-agent audit: the 21 confirmed correctness errors in otherwise-adequate/strong skills (the weakest-15 were #32). Same pipeline — surgical fix → adversarial verify → hand-correct — but tuned to touch only the confirmed error (diffs are deliberately tiny: 182+/132− across 21 files). 14/21 came back verifier-clean; 3 were flagged and hand-corrected.

🔴 SEVERE — fabrications / wrong identifiers

Skill Fix
okta-attack fabricated CVE-2024-VERIFY → real CVE-2024-10327 (Okta Verify iOS push bypass, verified on NVD)
enterprise-vpn-attack CVE-2024-46805CVE-2023-46805 (Ivanti, the real number)
supply-chain-attack-recon SUNBURST no longer mislabeled CVE-2020-10148; cite CISA AA20-352A
meme-coin-audit removed invented "35%/25%/20%" stats + phantom tool refs
hunt-subdomain removed an unverifiable HackerOne id (#1487793) — kept the technique
bugcrowd-reporting fixed backwards chain-submission UUID ordering (§5.1 and §8.1)
hunt-business-logic corrected the "HMAC replay with modified payload bypasses" claim

🟠 HIGH — stale facts / backwards logic / class conflations (14)

hunt-xss (removed-2019 XSS Auditor), hunt-xxe (.NET XmlReader default), triage-validation (CVSS vectors recomputed), hunt-ssti/hunt-springboot (backwards #{7*7}), hunt-open-redirect (CRLF≠open-redirect), hunt-saml (gzip/comment-injection + de-overclaimed description), hunt-cache-poison, hunt-mfa-bypass, hunt-nextjs, redteam-mindset, vmware-vcenter-attack, mid-engagement-ir-detection, hunt-sharepoint.

What the adversarial verifier caught (and I hand-fixed)

  • bugcrowd-reporting — §5.1 fixed but §8.1 left contradicting it → updated §8.1 too
  • meme-coin-audit — fixer introduced "Flashbots SUAVE as live 2026 infra" (it's testnet) → removed
  • hunt-samlneeds-rework: core fixes good, but description still over-claimed XSW1-XSW8 / "real paid examples" → trimmed to match the body

Anti-fabrication

Every added identifier is a well-known real CVE or a real advisory (verified the two riskiest — CVE-2024-10327, CVE-2023-46805 — and removed the one unverifiable HackerOne id rather than keep it).

Gates: all 71 lint-clean (incl. the YAML-safety check), descriptions ≤1024, bodies ≤500.

Second accuracy pass from the multi-agent audit — the confirmed correctness
errors in otherwise-adequate/strong skills (the weakest 15 were PR #32). Same
pipeline: surgical fix → adversarial verify → hand-correct. 14/21 verifier-clean;
3 flagged (bugcrowd §8.1 stale ordering, meme-coin SUAVE overstatement, hunt-saml
partial) all hand-corrected. Diffs are intentionally small (182+/132−).

SEVERE — fabrications / wrong identifiers:
- okta-attack: fabricated `CVE-2024-VERIFY` -> real CVE-2024-10327 (Okta Verify iOS
  push-notification bypass; web-verified on NVD)
- enterprise-vpn-attack: `CVE-2024-46805` -> CVE-2023-46805 (Ivanti, the real number)
- supply-chain-attack-recon: SUNBURST no longer mislabeled CVE-2020-10148; cite CISA AA20-352A
- meme-coin-audit: removed invented "35%/25%/20%" stats + phantom tool refs; SUAVE de-overstated
- hunt-subdomain: removed an UNVERIFIABLE HackerOne report id (#1487793) — kept the technique
- bugcrowd-reporting: fixed backwards chain-submission UUID ordering (§5.1 AND §8.1)
- hunt-business-logic: corrected the "HMAC replay with modified payload bypasses" claim

HIGH — stale facts / backwards logic / class conflations:
- hunt-xss (removed-2019 Chrome XSS Auditor), hunt-xxe (.NET XmlReader DtdProcessing
  default), triage-validation (CVSS vectors recomputed), hunt-ssti/hunt-springboot
  (backwards `#{7*7}` logic), hunt-open-redirect (CRLF != open-redirect), hunt-saml
  (gzip/comment-injection corrected; description de-overclaimed), hunt-cache-poison,
  hunt-mfa-bypass, hunt-nextjs, redteam-mindset, vmware-vcenter-attack,
  mid-engagement-ir-detection, hunt-sharepoint.

Anti-fabrication enforced: every added identifier is a well-known real CVE (Ivanti
2023-46805, Next.js 2024-34351, SharePoint ToolShell 2025-49704/49706, ruby-saml
2017-11428, Okta 2024-10327) or a real advisory (CISA AA20-352A); the one
unverifiable HackerOne id was removed, not kept.

All 71 lint clean (incl. the new YAML-safety check), descriptions <=1024, bodies <=500.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@elementalsouls elementalsouls merged commit 5dfa9f4 into main Jun 7, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants