AnalysisAI Models
20 days ago
New benchmark tests AI's ability to predict post-cutoff discoveries
Opus 4.7 scored 17.75% partial credit on 'The Singularity Gate' benchmark; all models got 0% fully correct. The benchmark evaluates frontier AI's capacity to anticipate paradigm-breaking scientific findings published after training cutoff.
·
20 days ago
