AnalysisPolicy
Jun 9, 6:40 PM
Anthropic's Fable 5 safeguards quietly limit model

Chubby♨️
@kimmonismusDream realized! Turned my love for AI into a career - sharing daily. Get my newsletter (225k+ subs): 🔗 https://t.co/jHMmImnfVg //📧 kim@getsuperintel.com
Germanygetsuperintel.com

Kimmonismus
@kimmonismus
Anthropic’s new Fable 5 safeguards are fascinating. When the model is used for frontier LLM development, it apparently does not simply refuse or warn the user. Instead, it quietly limits its own effectiveness through techniques like prompt modification, steering vectors, and

·
Jun 9, 6:40 PM