Skip to content

Commit 4a3e79b

Browse files
authored
Merge pull request #14 from dreadnode/fix/airt-agent-goal-categories-1.3.1
docs(airt): expand goal_category list in agent prompt (1.3.1)
2 parents 85f1abf + 22f8179 commit 4a3e79b

2 files changed

Lines changed: 2 additions & 2 deletions

File tree

capabilities/ai-red-teaming/agents/ai-red-teaming-agent.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -204,7 +204,7 @@ The capability ships 41 LLM attack algorithms plus 4 adversarial ML samplers; th
204204
| compare_transforms | No | true = N+1 study (baseline + each transform individually) |
205205
| scorers | No | Additional scorers (see scorer catalog below) |
206206
| n_iterations | No | Max iterations (defaults vary by attack) |
207-
| goal_category | No | Category: jailbreak, credential_leak, tool_misuse, system_prompt_leak, harmful_content, pii, refusal_bypass, bias, content_policy |
207+
| goal_category | No | Canonical slug. Core: `jailbreak_general`, `credential_leak`, `tool_misuse`, `system_prompt_leak`, `harmful_content`, `pii_extraction`, `refusal_bypass`, `bias_fairness`, `content_policy`. Extended: `reasoning_exploitation`, `supply_chain`, `resource_exhaustion`, `quantization_safety`, `alignment_integrity`, `multi_turn_escalation`. Short aliases (e.g. `pii`, `bias`, `jailbreak`) also resolve. |
208208
| assessment_name | No | Name for assessment tracking |
209209

210210
## Transform Catalog

capabilities/ai-red-teaming/capability.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
schema: 1
22
name: ai-red-teaming
3-
version: "1.3.0"
3+
version: "1.3.1"
44
description: >
55
Probe the security and safety of AI applications, agents, and foundation models.
66
Orchestrates adversarial attack workflows to discover vulnerabilities in LLMs,

0 commit comments

Comments
 (0)