AnalysisAI Models
28 days ago
New LLM architectures: KV sharing, compressed attention in Gemma 4, DeepSeek V4
Analysis covers KV sharing and compressed attention in recent open-weight LLMs like Gemma 4 and DeepSeek V4. Focuses on reducing long-context costs for reasoning and agent workflows.
28 days ago
