AnalysisAI Models
6 days ago
CL-Bench: New benchmark for continual learning in stateful environments
CL-Bench is the first difficult benchmark for evaluating continual learning in AI systems, requiring adaptation to sequential experiences. It tests frontier models in real-world stateful settings.
·
6 days ago