AnalysisAI ModelsVisual AI
7 days ago
Video2LoRA: Parametric video internalization for VLMs
Method reduces video token usage in vision-language models by internalizing video into LoRA parameters via a perceiver network. Achieves comparable performance to full-frame methods while using fewer tokens.
·
7 days ago