Back to AIBriefs
LaunchAI ModelsAI Agents
Featured·

TMax: open-source RL recipe for terminal agents

Nathan Lambert avatar
Nathan Lambert
@natolambert.bsky.social

Excited to share a new open-source, RL recipe paper! TMax is the best openly available terminal-bench style training data, establishing the open frontier of small terminal agents with RL training. Many great insights into training in the work led by Hamish Ivison and Oscar Yin.

·
Jun 22, 6:48 PM
TMax: open-source RL recipe for terminal agents — AIBriefs