AnalysisAI Models
5 hours ago
Featured
Talk explores attention drift in speculative decoding
Research introduces attention drift, a pattern shift in speculative decoding models, and proposes a post-norm fix. Experiments show the fix improves inference quality.
·
5 hours ago