AnalysisPolicy
25 days ago
METR publishes frontier AI risk report with Anthropic, Google, Meta, OpenAI
The pilot exercise assessed misalignment risks of AI agents used within frontier labs, with participants providing access to their most capable models and non-public information. Key findings cover harmful actions agents could take, motives, and opportunities for success.
25 days ago
