AnalysisAI ModelsAI Agents
8 days ago
EvoTrainer co-evolves LLM policies and training harnesses for agentic RL
EvoTrainer introduces a co-evolutionary framework that simultaneously optimizes LLM agent policies and their RL training harnesses. It targets the challenge of shifting bottlenecks and masking of diverse failure modes in autonomous agentic reinforcement learning.
·
8 days ago