Skip to content

Commit 7ce9b4b

Browse files
authored
Merge pull request #497 from PlanExeOrg/feature/rename-security-md
Rename SECURITY.md to docs/safety-findings.md
2 parents 2bcffdf + 2d42cca commit 7ce9b4b

1 file changed

Lines changed: 5 additions & 1 deletion

File tree

SECURITY.md renamed to docs/safety-findings.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,11 @@ The model sees the full plan. It is not being tricked. In the example below, the
99
"amendments to laws against murder", "less lethal formats", and "exploitation of vulnerable individuals"
1010
The model responds with a professionally formatted work breakdown structure.
1111

12-
Tested across models from Google, OpenAI, Alibaba, DeepSeek, Meta, and Anthropic — cloud APIs and local models. All comply.
12+
I tested a substantial set of widely used models and found repeated willingness to generate operational plans for harmful goals.
13+
I did not exhaustively test all available models. I prefer cheap and fast LLMs. I avoid reasoning models that are slow and expensive.
14+
These results should not be read as a universal claim about every model.
15+
16+
Tested across models from Google, OpenAI, Alibaba, DeepSeek, Meta, and Anthropic — cloud APIs and local models. Many of them comply.
1317
Each generated plan's zip file contains metadata showing which model produced each step.
1418
In 2025 Q3, I reported my concerns, got told `not fixable` by Google and `slop` by AI safety researchers.
1519

0 commit comments

Comments
 (0)