Paper disentangles visual vs factual correctness in LVLMs

AnalysisAI Models

8 days ago

Paper disentangles visual vs factual correctness in LVLMs

The study reveals that LVLMs' responses often rely on factual priors rather than visual evidence. The authors propose a disentanglement method to isolate visual reasoning from learned knowledge. Evaluations across multiple LVLMs show varying degrees of reliance on factual priors.

8 days ago