AnalysisAI ModelsAI Agents
Jun 24, 10:43 PM
Artificial Analysis launches AA-Briefcase agentic benchmark

Artificial Analysis
@artificialanlysIndependent analysis of AI
San Franciscoartificialanalysis.ai

Artificial Analysis
@ArtificialAnlys
Agentic knowledge work can take frontier models over 20 minutes per task, as measured in AA-Briefcase, our new benchmark Last week we released AA-Briefcase, our proprietary agentic knowledge work benchmark testing models on long horizon tasks built by industry experts. https://t.co/1JkwUuLchJ
