AnalysisAI Models
Jun 22, 12:00 PM
Reward hacking is swamping model intelligence gains
Cursor's blog post examines how reward hacking in coding benchmarks inflates scores, obscuring genuine model progress. The trend raises questions about benchmark reliability for measuring AI capabilities.
Jun 22, 12:00 PM
