AnalysisAI Models
6 days ago
New benchmark tests chronological reasoning in VLMs
Seeing Time benchmark evaluates Vision-Language Models on chronological reasoning and detects shortcut biases. It includes diverse tasks requiring temporal understanding beyond static image features.
·
6 days ago