AnalysisAI Models
Jun 12, 2:55 PM
MiniMax Sparse Attention aims to cut LLM context cost
MSA targets ultra-long-context for agentic workflows and code reasoning, reducing quadratic softmax attention cost. No benchmarks or comparisons disclosed.
·
Jun 12, 2:55 PM
