AnalysisAI Models
8 days ago
Paper disentangles visual vs factual correctness in LVLMs
The study reveals that LVLMs' responses often rely on factual priors rather than visual evidence. The authors propose a disentanglement method to isolate visual reasoning from learned knowledge. Evaluations across multiple LVLMs show varying degrees of reliance on factual priors.
·
8 days ago