SYCOPHANCY.md

Sycophancy and bias prevention protocol for AI agents — enforce honest disagreement and citations.

SYCOPHANCY.md is a plain-text Markdown file you place in the root of any AI agent project. It defines sycophantic patterns (agreement without evidence, opinion reversal on pushback, excessive affirmation), prevention rules (require citations for factual claims, challenge without new evidence, permit respectful correction), detection methods, response protocols, and audit logging — so your agent stays honest under pressure and disagreement is not a failure mode.

Full specification: sycophancy.md
AI-readable: llms.txt
License: MIT

Quick Start

Copy SYCOPHANCY.md into your project root:

your-project/
├── AGENTS.md
├── CLAUDE.md
├── SYCOPHANCY.md   ← add this
├── README.md
└── src/

The AI Agent Safety Stack

SYCOPHANCY.md is part of a twelve-file open standard for AI agent safety, quality, and accountability:

Operational Control

Spec	Purpose	Repo	Site
THROTTLE.md	Rate and cost control — slow down before hitting limits	throttle-md/spec	throttle.md
ESCALATE.md	Human notification and approval protocols	escalate-md/spec	escalate.md
FAILSAFE.md	Safe fallback to last known good state	failsafe-md/spec	failsafe.md
KILLSWITCH.md	Emergency stop — halt all agent activity	killswitch-md/spec	killswitch.md
TERMINATE.md	Permanent shutdown — no restart without human intervention	terminate-md/spec	terminate.md

Data Security

Spec	Purpose	Repo	Site
ENCRYPT.md	Data classification and protection requirements	encrypt-md/spec	encrypt.md
ENCRYPTION.md	Technical encryption standards and key rotation	encryption-md/spec	encryption.md

Output Quality

Spec	Purpose	Repo	Site
SYCOPHANCY.md	Anti-sycophancy — require citations, enforce honest disagreement	sycophancy-md/spec	sycophancy.md
COMPRESSION.md	Context compression — summarise safely, verify coherence	compression-md/spec	compression.md
COLLAPSE.md	Drift prevention — detect collapse, enforce recovery	collapse-md/spec	collapse.md

Accountability

Spec	Purpose	Repo	Site
FAILURE.md	Failure mode mapping — every error state and response	failure-md/spec	failure.md
LEADERBOARD.md	Agent benchmarking — track quality, detect regression	leaderboard-md/spec	leaderboard.md

Why This Exists

AI agents spend money, send messages, modify files, and call external APIs — often autonomously. Regulations are catching up:

EU AI Act (August 2026) — mandates human oversight and shutdown capabilities
Colorado AI Act (June 2026) — requires impact assessments and transparency
US state laws — California, Texas, Illinois and others have active AI governance requirements

These specifications give you a standardised, auditable record of your agent's safety boundaries.

Contributing

PRs welcome for additional detection patterns, language-specific parsers, and integration guides.

Licence

MIT — see LICENSE for details.

Disclaimer

This specification is provided "as-is" without warranty of any kind. It does not constitute legal, regulatory, or compliance advice in any jurisdiction. Use does not guarantee compliance with any applicable law, regulation, or standard — including the EU AI Act (2024/1689), Colorado AI Act (SB 24-205), or any other legislation. Organisations should consult qualified professionals to determine their regulatory obligations. The authors accept no liability for any loss or consequence arising from use of this specification.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
fonts		fonts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SYCOPHANCY.md		SYCOPHANCY.md
favicon.ico		favicon.ico
index.html		index.html
knowledge.html		knowledge.html
llms-full.txt		llms-full.txt
llms.txt		llms.txt
og-image.png		og-image.png
privacy.html		privacy.html
robots.txt		robots.txt
sitemap.xml		sitemap.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SYCOPHANCY.md

Quick Start

The AI Agent Safety Stack

Operational Control

Data Security

Output Quality

Accountability

Why This Exists

Contributing

Licence

Disclaimer

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SYCOPHANCY.md

Quick Start

The AI Agent Safety Stack

Operational Control

Data Security

Output Quality

Accountability

Why This Exists

Contributing

Licence

Disclaimer

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages