AnalysisAI Models
Jun 10, 11:16 PM
GPT-5.5 beats Claude Fable 5 on new Agents' Last Exam benchmark
Researchers from UC Berkeley's RDI launched Agents' Last Exam (ALE), a new benchmark for AI agent capabilities. Initial results show OpenAI's GPT-5.5 outperforming Anthropic's Claude Fable 5.
·
Jun 10, 11:16 PM
