Skip to content

Fix the 15 weakest skills (audit-driven, fabrication-checked)#32

Merged
elementalsouls merged 1 commit into
mainfrom
fix-weakest-skills
Jun 7, 2026
Merged

Fix the 15 weakest skills (audit-driven, fabrication-checked)#32
elementalsouls merged 1 commit into
mainfrom
fix-weakest-skills

Conversation

@elementalsouls

Copy link
Copy Markdown
Owner

Rewrites the 15 weakest skills identified by the multi-agent quality audit (71 skills scored on a 6-dimension rubric, weak/error flags adversarially re-checked). These scored 2.17–2.50/5, almost all with grounding=1 (no real citations), thin validation, and confirmed correctness bugs.

Process (audit → fix → adversarial verify → hand-correct)

  1. Fix — one expert agent per skill, with hard no-fabrication rules.
  2. Verify — a separate adversarial agent hunted each fixed file for invented CVEs/citations and new errors. 7/15 came back clean; 8 were flagged — exactly what the pass is for.
  3. Hand-correct — I fixed all 8 flags myself and independently checked the citations.

Confirmed correctness bugs fixed (sample)

Skill Bug fixed
hunt-k8s broken kubelet /exec (SPDY stream, not plain POST) → /run primitive
hunt-cicd cat $GITHUB_TOKEN (env var, not a file)
hunt-grpc h2load≠Rapid-Reset; wrong tool flags + wrong cloudflare/quiche URL
hunt-cors wrong regex-bypass example
hunt-websocket wrong socket.io nsp= syntax; removed mischaracterized CVE-2017-16031
hunt-dom broken cross-origin Service-Worker payload; corrected DOM-clobbering example
hunt-tls-network CAA mischaracterization; replaced dead openssl -ssl2/-ssl3 flags
hunt-ldap {SSHA} hashcat mode 1411111
hunt-lfi misleading php://input; added filter-chain RCE; fixed invented "Gaztech" credit

Anti-fabrication (the #1 risk, given the audit's findings)

  • Added only well-known, verifiable CVEs (Heartbleed/POODLE/DROWN/FREAK/SWEET32, Apache 2.4.49/.50, runc Leaky Vessels, jQuery 11022/11023, Jenkins CVE-2024-23897, …).
  • Removed two HackerOne report IDs (#226659, #281575) that I could not independently verify (HackerOne pages are JS SPAs — unverifiable via fetch). No invented report IDs, payouts, or statistics.

Quality gates

All 15: lint clean (0 errors), descriptions ≤1024, bodies ≤500 lines, YAML-safe (strict/Codex parsers).

Note: independent of PR #31 (multi-harness) — different files, both branch off main.

🤖 Generated with Claude Code

The multi-agent quality audit (71 skills, 6-dimension rubric, adversarially
re-checked) flagged these 15 as weakest (2.17–2.50/5), almost all sharing:
grounding=1 (no real citations), thin validation, and confirmed correctness bugs.

Process: audit → expert fixer per skill (HARD no-fabrication rules) → adversarial
verifier hunting each fixed file for invented CVEs/citations/new errors → manual
correction of every flag. 7/15 came back verifier-clean; the other 8 had issues
the verifier caught (this is the point of the pass) — all hand-corrected here.

Confirmed correctness bugs fixed (examples):
- hunt-k8s: broken kubelet /exec (SPDY stream, not plain POST) → /run primitive
- hunt-cicd: `cat $GITHUB_TOKEN` (env var, not a file) corrected
- hunt-grpc: h2load is not a Rapid-Reset PoC; fixed tool + wrong CLI flags + quiche URL
- hunt-cors: wrong regex-bypass example corrected
- hunt-websocket: wrong socket.io `nsp=` syntax; removed mischaracterized CVE-2017-16031
- hunt-dom: broken cross-origin Service-Worker payload; corrected DOM-clobbering example
- hunt-tls-network: CAA mischaracterization; replaced dead `openssl -ssl2/-ssl3` flags
- hunt-ldap: `{SSHA}` hashcat mode 1411→111 (1411 is {SSHA256})
- hunt-lfi: misleading php://input; added filter-chain RCE; fixed invented "Gaztech" credit

Grounding: added only well-known, verifiable CVEs (Heartbleed/POODLE/DROWN/FREAK/SWEET32,
Apache 2.4.49/.50 path traversal, runc Leaky Vessels, jQuery 11022/11023, Jenkins
CVE-2024-23897, etc.). Anti-fabrication enforced: REMOVED two HackerOne report IDs
(#226659, #281575) that could not be independently verified — no invented report
IDs, payouts, or stats. Validation/false-positive discipline strengthened throughout.

All 15: lint clean, descriptions ≤1024, bodies ≤500 lines, YAML-safe (strict/Codex).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@elementalsouls elementalsouls merged commit d2b44ea into main Jun 7, 2026
1 check passed
@elementalsouls elementalsouls deleted the fix-weakest-skills branch June 7, 2026 08:53
pull Bot pushed a commit to danielabelski/Claude-BugHunter that referenced this pull request Jun 7, 2026
Second accuracy pass from the multi-agent audit — the confirmed correctness
errors in otherwise-adequate/strong skills (the weakest 15 were PR elementalsouls#32). Same
pipeline: surgical fix → adversarial verify → hand-correct. 14/21 verifier-clean;
3 flagged (bugcrowd §8.1 stale ordering, meme-coin SUAVE overstatement, hunt-saml
partial) all hand-corrected. Diffs are intentionally small (182+/132−).

SEVERE — fabrications / wrong identifiers:
- okta-attack: fabricated `CVE-2024-VERIFY` -> real CVE-2024-10327 (Okta Verify iOS
  push-notification bypass; web-verified on NVD)
- enterprise-vpn-attack: `CVE-2024-46805` -> CVE-2023-46805 (Ivanti, the real number)
- supply-chain-attack-recon: SUNBURST no longer mislabeled CVE-2020-10148; cite CISA AA20-352A
- meme-coin-audit: removed invented "35%/25%/20%" stats + phantom tool refs; SUAVE de-overstated
- hunt-subdomain: removed an UNVERIFIABLE HackerOne report id (#1487793) — kept the technique
- bugcrowd-reporting: fixed backwards chain-submission UUID ordering (§5.1 AND §8.1)
- hunt-business-logic: corrected the "HMAC replay with modified payload bypasses" claim

HIGH — stale facts / backwards logic / class conflations:
- hunt-xss (removed-2019 Chrome XSS Auditor), hunt-xxe (.NET XmlReader DtdProcessing
  default), triage-validation (CVSS vectors recomputed), hunt-ssti/hunt-springboot
  (backwards `#{7*7}` logic), hunt-open-redirect (CRLF != open-redirect), hunt-saml
  (gzip/comment-injection corrected; description de-overclaimed), hunt-cache-poison,
  hunt-mfa-bypass, hunt-nextjs, redteam-mindset, vmware-vcenter-attack,
  mid-engagement-ir-detection, hunt-sharepoint.

Anti-fabrication enforced: every added identifier is a well-known real CVE (Ivanti
2023-46805, Next.js 2024-34351, SharePoint ToolShell 2025-49704/49706, ruby-saml
2017-11428, Okta 2024-10327) or a real advisory (CISA AA20-352A); the one
unverifiable HackerOne id was removed, not kept.

All 71 lint clean (incl. the new YAML-safety check), descriptions <=1024, bodies <=500.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants