AnalysisAI Models
6 days ago
UltraVR benchmark evaluates VLMs on ultra-resolution image VQA
The benchmark tests vision-language models on ultra-resolution images where critical evidence is tiny, subtle, or distributed. It aims to expose limitations in current models on high-resolution, evidence-grounded reasoning tasks.
·
6 days ago