LaunchDevelopers
2 days ago
Cognition unveils FrontierCode, a coding benchmark focused on code quality
FrontierCode tasks each required 40+ hours from open-source maintainers. The benchmark measures whether code would actually be merged, addressing findings that over half of SWEBench results are unmergeable slop.
2 days ago
