FindIt benchmark for multimodal LLMs on visual detection

AnalysisAI Models

7 days ago

FindIt benchmark for multimodal LLMs on visual detection

FindIt is a format-informed visual detection benchmark for generalist multimodal LLMs. It evaluates models on structured tasks like object detection and layout analysis.

7 days ago