AnalysisAI Models
5 days ago
Featured
Jędrzej Maczan presents Online Softmax talk
Cohere publishes a technical talk on the online softmax algorithm, which computes softmax in a single pass to improve efficiency. The talk covers the safe softmax trick, a proof by induction, and parallelization techniques for ML practitioners.
·
5 days ago