AnalysisAI Models
7 days ago
FindIt benchmark for multimodal LLMs on visual detection
FindIt is a format-informed visual detection benchmark for generalist multimodal LLMs. It evaluates models on structured tasks like object detection and layout analysis.
·
7 days ago