AnalysisAI Models
23 hours ago
New benchmark PuMVR reveals script bias in multilingual VLMs
PuMVR (Punjabi Multimodal Visual Reasoning) tests VLMs on Punjabi text in Gurmukhi and Shahmukhi scripts, revealing significant accuracy drops across scripts for the same language. The benchmark challenges the one-to-one language-script mapping assumption in multilingual VLM evaluation.