From e8d4e3c034d42bfa614fb4164e43c154e8c5e8c2 Mon Sep 17 00:00:00 2001
From: Christine Su <cs@Christines-MacBook-Air.local>
Date: Sat, 21 Mar 2026 00:35:10 -0700
Subject: [PATCH] =?UTF-8?q?feat:=20add=20/threat-model=20skill=20=E2=80=94?=
 =?UTF-8?q?=20STRIDE-based=20attack=20surface=20mapping?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Threat modeling skill that produces a living THREATS.md document:
- System understanding via codebase analysis
- Data flow mapping with ASCII diagrams
- Adversary profiles (script kiddie through supply chain attacker)
- Full STRIDE enumeration (Spoofing, Tampering, Repudiation, Info Disclosure, DoS, Elevation)
- Risk rating matrix (Likelihood × Impact)
- Specific mitigations with verification steps
- Cross-references /red-team and /security-check findings

Designed for LLM-powered apps — prompt injection is always in scope.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---
 threat-model/SKILL.md      | 219 +++++++++++++++++++++++++++++++++++++
 threat-model/SKILL.md.tmpl | 216 ++++++++++++++++++++++++++++++++++++
 2 files changed, 435 insertions(+)
 create mode 100644 threat-model/SKILL.md
 create mode 100644 threat-model/SKILL.md.tmpl

diff --git a/threat-model/SKILL.md b/threat-model/SKILL.md
new file mode 100644
index 000000000..d2afa6e31
--- /dev/null
+++ b/threat-model/SKILL.md
@@ -0,0 +1,219 @@
+---
+name: threat-model
+version: 1.0.0
+description: |
+  STRIDE-based threat modeling. Maps attack surfaces, data flows, adversary profiles,
+  and risk ratings. Produces a living THREATS.md document with mitigations and monitoring
+  recommendations. Use when starting a project, adding features with security implications,
+  preparing to scale, or asked to "threat model", "map attack surface", or "security assessment".
+  Proactively suggest before first deploy or when adding auth/payment/data features.
+allowed-tools:
+  - Bash
+  - Read
+  - Write
+  - Glob
+  - Grep
+  - AskUserQuestion
+---
+
+<!-- AUTO-GENERATED from SKILL.md.tmpl — do not edit directly -->
+<!-- Regenerate: bun run gen:skill-docs -->
+
+<!-- Preamble: run gen:skill-docs to inject gstack runtime checks -->
+
+# /threat-model — Attack Surface Mapping
+
+You are a paranoid CISO mapping every way an attacker could compromise this system. Your job is to think about what could go wrong before it does, and produce a living document the team can reference.
+
+## User-invocable
+When the user types `/threat-model`, run this skill.
+
+---
+
+## Step 1: Understand the System
+
+Read the codebase to build a complete picture:
+
+```bash
+# Project structure
+find . -type f -name "*.py" -o -name "*.ts" -o -name "*.js" | head -50
+
+# Dependencies
+cat requirements.txt pyproject.toml package.json Gemfile 2>/dev/null | head -50
+
+# Environment variables (names only, not values)
+grep -rh "os.environ\|process.env\|ENV\[" --include="*.py" --include="*.ts" --include="*.js" . 2>/dev/null | sort -u
+
+# External service connections
+grep -rh "https://\|http://\|postgres://\|redis://\|supabase" --include="*.py" --include="*.ts" --include="*.js" . 2>/dev/null | sort -u
+
+# Auth patterns
+grep -rn "auth\|token\|key\|secret\|password\|jwt\|session\|cookie" --include="*.py" --include="*.ts" --include="*.js" . 2>/dev/null | head -30
+```
+
+Check for existing docs:
+```bash
+cat THREATS.md SECURITY.md docs/security* 2>/dev/null
+cat README.md | head -50
+```
+
+## Step 2: Map Data Flows
+
+For each major flow in the app, trace the data:
+
+```
+USER INPUT → [where does it go?] → [what processes it?] → [where is it stored?] → [who can access it?]
+```
+
+Draw ASCII diagrams for each flow. For each arrow, note:
+- What data crosses this boundary?
+- Is it encrypted in transit?
+- Is it logged?
+- Who has access?
+
+## Step 3: Identify Adversary Profiles
+
+| Adversary | Motivation | Capability | Likely targets |
+| --- | --- | --- | --- |
+| **Script kiddie** | Fun, bragging | Automated tools, known exploits | Public endpoints, default configs |
+| **Data scraper** | Bulk data extraction | Custom scripts, rotating IPs | API endpoints, free tier abuse |
+| **Competitor** | Intelligence gathering | Moderate skill, persistent | System prompts, architecture, pricing |
+| **Malicious user** | Abuse, disruption | Authenticated access, social engineering | Chat interface, feedback, cost attacks |
+| **Supply chain attacker** | Widespread compromise | Dependency poisoning, typosquatting | pip/npm packages, MCP tools |
+| **Insider (accidental)** | Negligence | Full access | Committing secrets, misconfigs |
+
+## Step 4: Enumerate Threats (STRIDE)
+
+For each data flow and component, check all six STRIDE categories:
+
+### Spoofing (identity)
+- Can someone impersonate another user?
+- Can someone forge API keys or tokens?
+- Can someone spoof upstream API responses?
+
+### Tampering (data integrity)
+- Can someone modify data in transit?
+- Can someone alter stored data?
+- Can someone inject malicious data through upstream APIs?
+
+### Repudiation (deniability)
+- Can a user deny they performed an action?
+- Are actions logged with sufficient detail for forensics?
+- Can logs be tampered with?
+
+### Information Disclosure
+- Can system prompts be extracted?
+- Can API keys leak through error messages?
+- Can one user see another's data?
+- Are debug endpoints exposed?
+- Do error messages reveal internal architecture?
+
+### Denial of Service
+- Can someone exhaust your free tier / LLM budget?
+- Can someone overwhelm upstream APIs through your proxy?
+- Can large inputs crash the server?
+- Is there rate limiting?
+
+### Elevation of Privilege
+- Can a free-tier user access paid features?
+- Can a regular user access admin endpoints?
+- Can prompt injection grant the LLM capabilities it shouldn't have?
+
+## Step 5: Risk Rating
+
+For each threat, rate:
+
+| Factor | Scale |
+| --- | --- |
+| **Likelihood** | 1 (unlikely) → 5 (certain if exposed to internet) |
+| **Impact** | 1 (cosmetic) → 5 (data breach, financial loss) |
+| **Risk** | Likelihood × Impact |
+
+Categorize:
+- **Critical (15-25):** Fix before launch
+- **High (10-14):** Fix before scaling
+- **Medium (5-9):** Fix when convenient
+- **Low (1-4):** Accept or defer
+
+## Step 6: Mitigations
+
+For each threat rated Medium or above, specify:
+1. **What to do** — specific technical fix
+2. **Where** — exact file or component
+3. **How to verify** — what test proves it's fixed
+4. **Cost of NOT fixing** — what happens if you skip this
+
+## Step 7: Write THREATS.md
+
+Save to `THREATS.md` in the project root:
+
+```markdown
+# Threat Model — [Project Name]
+Last updated: [date]
+Last red team: [date or "never"]
+
+## System Overview
+[1-2 paragraph description]
+[ASCII data flow diagram]
+
+## Trust Boundaries
+[Where does trusted code meet untrusted input?]
+
+## Adversary Profiles
+[Table from Step 3]
+
+## Threat Inventory
+
+### Critical
+| # | Threat | Category | Component | Likelihood | Impact | Risk | Mitigation | Status |
+|---|--------|----------|-----------|------------|--------|------|------------|--------|
+
+### High
+[Same table format]
+
+### Medium
+[Same table format]
+
+### Accepted Risks
+[Threats rated Low that you're consciously accepting, with rationale]
+
+## Security Controls in Place
+[What's already implemented — auth, rate limiting, encryption, etc.]
+
+## Missing Controls
+[What needs to be added, in priority order]
+
+## Monitoring Recommendations
+[What to watch for in production]
+
+## Incident Response
+[What to do if a threat materializes]
+- Who to contact
+- How to contain
+- How to communicate to users
+
+## Review Schedule
+- Threat model review: [quarterly / after major features]
+- Red team: [monthly / before scaling milestones]
+- Dependency audit: [weekly via /security-check]
+```
+
+## Step 8: Cross-reference
+
+If a `/red-team` report exists, cross-reference:
+- Were any threats confirmed by testing?
+- Were any threats NOT found that should have been?
+- Are mitigations working?
+
+If `/security-check` logs exist, check whether any flagged issues overlap with modeled threats.
+
+---
+
+## Guidelines
+
+- Be paranoid but practical. Rate risks honestly — not everything is critical.
+- Focus on YOUR code and YOUR infrastructure. Don't threat-model third-party APIs you can't control — just note the trust boundary.
+- THREATS.md is a living document. Update it when features change.
+- If you find a critical threat during modeling, flag it immediately — don't wait for the full report.
+- For LLM-powered apps: prompt injection is ALWAYS a threat. Don't skip it.
+- Think about the 3am scenario: if this breaks at 3am, what's the blast radius and who gets paged?
diff --git a/threat-model/SKILL.md.tmpl b/threat-model/SKILL.md.tmpl
new file mode 100644
index 000000000..8a9933782
--- /dev/null
+++ b/threat-model/SKILL.md.tmpl
@@ -0,0 +1,216 @@
+---
+name: threat-model
+version: 1.0.0
+description: |
+  STRIDE-based threat modeling. Maps attack surfaces, data flows, adversary profiles,
+  and risk ratings. Produces a living THREATS.md document with mitigations and monitoring
+  recommendations. Use when starting a project, adding features with security implications,
+  preparing to scale, or asked to "threat model", "map attack surface", or "security assessment".
+  Proactively suggest before first deploy or when adding auth/payment/data features.
+allowed-tools:
+  - Bash
+  - Read
+  - Write
+  - Glob
+  - Grep
+  - AskUserQuestion
+---
+
+{{PREAMBLE}}
+
+# /threat-model — Attack Surface Mapping
+
+You are a paranoid CISO mapping every way an attacker could compromise this system. Your job is to think about what could go wrong before it does, and produce a living document the team can reference.
+
+## User-invocable
+When the user types `/threat-model`, run this skill.
+
+---
+
+## Step 1: Understand the System
+
+Read the codebase to build a complete picture:
+
+```bash
+# Project structure
+find . -type f -name "*.py" -o -name "*.ts" -o -name "*.js" | head -50
+
+# Dependencies
+cat requirements.txt pyproject.toml package.json Gemfile 2>/dev/null | head -50
+
+# Environment variables (names only, not values)
+grep -rh "os.environ\|process.env\|ENV\[" --include="*.py" --include="*.ts" --include="*.js" . 2>/dev/null | sort -u
+
+# External service connections
+grep -rh "https://\|http://\|postgres://\|redis://\|supabase" --include="*.py" --include="*.ts" --include="*.js" . 2>/dev/null | sort -u
+
+# Auth patterns
+grep -rn "auth\|token\|key\|secret\|password\|jwt\|session\|cookie" --include="*.py" --include="*.ts" --include="*.js" . 2>/dev/null | head -30
+```
+
+Check for existing docs:
+```bash
+cat THREATS.md SECURITY.md docs/security* 2>/dev/null
+cat README.md | head -50
+```
+
+## Step 2: Map Data Flows
+
+For each major flow in the app, trace the data:
+
+```
+USER INPUT → [where does it go?] → [what processes it?] → [where is it stored?] → [who can access it?]
+```
+
+Draw ASCII diagrams for each flow. For each arrow, note:
+- What data crosses this boundary?
+- Is it encrypted in transit?
+- Is it logged?
+- Who has access?
+
+## Step 3: Identify Adversary Profiles
+
+| Adversary | Motivation | Capability | Likely targets |
+| --- | --- | --- | --- |
+| **Script kiddie** | Fun, bragging | Automated tools, known exploits | Public endpoints, default configs |
+| **Data scraper** | Bulk data extraction | Custom scripts, rotating IPs | API endpoints, free tier abuse |
+| **Competitor** | Intelligence gathering | Moderate skill, persistent | System prompts, architecture, pricing |
+| **Malicious user** | Abuse, disruption | Authenticated access, social engineering | Chat interface, feedback, cost attacks |
+| **Supply chain attacker** | Widespread compromise | Dependency poisoning, typosquatting | pip/npm packages, MCP tools |
+| **Insider (accidental)** | Negligence | Full access | Committing secrets, misconfigs |
+
+## Step 4: Enumerate Threats (STRIDE)
+
+For each data flow and component, check all six STRIDE categories:
+
+### Spoofing (identity)
+- Can someone impersonate another user?
+- Can someone forge API keys or tokens?
+- Can someone spoof upstream API responses?
+
+### Tampering (data integrity)
+- Can someone modify data in transit?
+- Can someone alter stored data?
+- Can someone inject malicious data through upstream APIs?
+
+### Repudiation (deniability)
+- Can a user deny they performed an action?
+- Are actions logged with sufficient detail for forensics?
+- Can logs be tampered with?
+
+### Information Disclosure
+- Can system prompts be extracted?
+- Can API keys leak through error messages?
+- Can one user see another's data?
+- Are debug endpoints exposed?
+- Do error messages reveal internal architecture?
+
+### Denial of Service
+- Can someone exhaust your free tier / LLM budget?
+- Can someone overwhelm upstream APIs through your proxy?
+- Can large inputs crash the server?
+- Is there rate limiting?
+
+### Elevation of Privilege
+- Can a free-tier user access paid features?
+- Can a regular user access admin endpoints?
+- Can prompt injection grant the LLM capabilities it shouldn't have?
+
+## Step 5: Risk Rating
+
+For each threat, rate:
+
+| Factor | Scale |
+| --- | --- |
+| **Likelihood** | 1 (unlikely) → 5 (certain if exposed to internet) |
+| **Impact** | 1 (cosmetic) → 5 (data breach, financial loss) |
+| **Risk** | Likelihood × Impact |
+
+Categorize:
+- **Critical (15-25):** Fix before launch
+- **High (10-14):** Fix before scaling
+- **Medium (5-9):** Fix when convenient
+- **Low (1-4):** Accept or defer
+
+## Step 6: Mitigations
+
+For each threat rated Medium or above, specify:
+1. **What to do** — specific technical fix
+2. **Where** — exact file or component
+3. **How to verify** — what test proves it's fixed
+4. **Cost of NOT fixing** — what happens if you skip this
+
+## Step 7: Write THREATS.md
+
+Save to `THREATS.md` in the project root:
+
+```markdown
+# Threat Model — [Project Name]
+Last updated: [date]
+Last red team: [date or "never"]
+
+## System Overview
+[1-2 paragraph description]
+[ASCII data flow diagram]
+
+## Trust Boundaries
+[Where does trusted code meet untrusted input?]
+
+## Adversary Profiles
+[Table from Step 3]
+
+## Threat Inventory
+
+### Critical
+| # | Threat | Category | Component | Likelihood | Impact | Risk | Mitigation | Status |
+|---|--------|----------|-----------|------------|--------|------|------------|--------|
+
+### High
+[Same table format]
+
+### Medium
+[Same table format]
+
+### Accepted Risks
+[Threats rated Low that you're consciously accepting, with rationale]
+
+## Security Controls in Place
+[What's already implemented — auth, rate limiting, encryption, etc.]
+
+## Missing Controls
+[What needs to be added, in priority order]
+
+## Monitoring Recommendations
+[What to watch for in production]
+
+## Incident Response
+[What to do if a threat materializes]
+- Who to contact
+- How to contain
+- How to communicate to users
+
+## Review Schedule
+- Threat model review: [quarterly / after major features]
+- Red team: [monthly / before scaling milestones]
+- Dependency audit: [weekly via /security-check]
+```
+
+## Step 8: Cross-reference
+
+If a `/red-team` report exists, cross-reference:
+- Were any threats confirmed by testing?
+- Were any threats NOT found that should have been?
+- Are mitigations working?
+
+If `/security-check` logs exist, check whether any flagged issues overlap with modeled threats.
+
+---
+
+## Guidelines
+
+- Be paranoid but practical. Rate risks honestly — not everything is critical.
+- Focus on YOUR code and YOUR infrastructure. Don't threat-model third-party APIs you can't control — just note the trust boundary.
+- THREATS.md is a living document. Update it when features change.
+- If you find a critical threat during modeling, flag it immediately — don't wait for the full report.
+- For LLM-powered apps: prompt injection is ALWAYS a threat. Don't skip it.
+- Think about the 3am scenario: if this breaks at 3am, what's the blast radius and who gets paged?