AnalysisAI ModelsAI Agents
1 day ago
GameCraft-Bench evaluates agents on building playable games end-to-end
The benchmark requires coding agents to transform natural-language specifications into playable games within a real game engine. It tests end-to-end game generation, from specification to interactive system, without supervised game data.
