LaunchDevelopersAI Models
Jun 16, 11:26 PM
Gym-style benchmark for evaluating AI agent skills

Tom Dörr
@tom_doerrFollow for posts about GitHub repos, DSPy, and agents Subscribe for top posts DM to share your AI project (Due to volume of DMs I'll prioritize subscribers)
tom-doerr.github.io/repo_posts

Tom Doerr
@tom_doerr
Gym-style benchmark for evaluating AI agent skills https://t.co/kdaloCtrOw https://t.co/F0JaHiftfw

·
Jun 16, 11:26 PM