METR publishes frontier AI risk report with Anthropic, Google, Meta, OpenAI

AnalysisPolicy

25 days ago

METR publishes frontier AI risk report with Anthropic, Google, Meta, OpenAI

The pilot exercise assessed misalignment risks of AI agents used within frontier labs, with participants providing access to their most capable models and non-public information. Key findings cover harmful actions agents could take, motives, and opportunities for success.

25 days ago