AnalysisAI Models
8 days ago
Multi-agent debate can hurt data cleaning, study finds
Across 3 benchmarks, 4 model families, and over 6,000 task-condition pairs, multi-agent debate was found to degrade generation in data cleaning tasks. The paper proposes a fix to the harmful effects.
·
8 days ago