AnalysisAI ModelsVisual AI
7 days ago
Query-based cross-modal projector bolsters Mamba multimodal LLM
Proposes a query-based cross-modal projector to enhance Mamba-based multimodal large language models, addressing Transformer quadratic complexity. Aims to improve multimodal performance while reducing computational load.
·
7 days ago