AnalysisAI Models
8 days ago
TGV-KV: Text-Grounded KV Eviction for VLMs
The paper introduces TGV-KV, a text-grounded KV eviction method for VLMs that reduces memory consumption by selectively removing redundant tokens. It leverages cross-modal attention scores and achieves up to 50% cache reduction with minimal performance degradation. Evaluated on multiple benchmarks, TGV-KV demonstrates efficiency gains.
·
8 days ago