AnalysisAI Models
8 days ago
Locality does not imply reachability in block-sparse attention
Paper identifies a mismatch between sequence locality and attention-graph reachability in block-sparse causal attention. Authors propose a boundary repair mechanism to address the issue.
·
8 days ago