AnalysisAI Models
May 31, 6:27 PM
GPT-5.5 tops DeepSWE benchmark with 70% pass@1

OpenAI Developers
@openaidevsOfficial updates for developers building with Codex & the OpenAI Platform β’ Service status: https://t.co/kZwnwdYYEq
developers.openai.com

OpenAI Developers
@OpenAIDevs
RT @reach_vb: GPT-5.5 is #1 on DeepSWE, a hard long-horizon coding benchmark π₯ 70% pass@1 vs 58% for Claude Opus 4.8. And GPT-5.5 gets thβ¦
Β·
May 31, 6:27 PM