AnalysisAI ModelsAI Agents
6 days ago
Weakly supervised early failure alerting for LLM agents
Paper introduces weakly supervised method for early failure alerting in dialogs and LLM-agent trajectories, using only trajectory-level success/failure labels. The approach handles sparse supervision by leveraging partial trajectory data.
·
6 days ago