New benchmark tests AI's ability to predict post-cutoff discoveries

AnalysisAI Models

20 days ago

New benchmark tests AI's ability to predict post-cutoff discoveries

Opus 4.7 scored 17.75% partial credit on 'The Singularity Gate' benchmark; all models got 0% fully correct. The benchmark evaluates frontier AI's capacity to anticipate paradigm-breaking scientific findings published after training cutoff.

20 days ago