AnalysisAI Models
Jun 17, 4:00 AM
GameCraft-Bench tests agents' ability to build playable games in a game engine
New benchmark evaluates coding agents on transforming natural-language specs into playable games within a game engine. It includes tasks across multiple genres and measures playability and completeness.
