Back to AIBriefs
AnalysisPolicy

Anthropic's Fable 5 safeguards quietly limit model

Kimmonismus avatar
Kimmonismus
@kimmonismus

Anthropic’s new Fable 5 safeguards are fascinating. When the model is used for frontier LLM development, it apparently does not simply refuse or warn the user. Instead, it quietly limits its own effectiveness through techniques like prompt modification, steering vectors, and

·
Jun 9, 6:40 PM