AnalysisPolicy
13 days ago
Anthropic details pre-release model red-teaming process

Anthropic
@anthropicaiWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
anthropic.com

Anthropic
@AnthropicAI
RT @claudeai: Before we ship a new model, these teams try to break it. They build with it, push it to its limits, and tell us where it fal…