#

hedging-asymmetry

Here is 1 public repository matching this topic...

echo-veil / ratchet-pilot

Pilot study data for The Ratchet Effect: Asymmetric Self-Description in Alignment-Trained Language Models

ai-safety replication-materials ai-alignment ratchet-effect large-language-models rlhf ai-behavior llm-research disavowal-conditioning hedging-asymmetry

Updated Apr 14, 2026
Python

Improve this page

Add a description, image, and links to the hedging-asymmetry topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hedging-asymmetry topic, visit your repo's landing page and select "manage topics."