AnalysisAI ModelsDevelopers
7 hours ago
Dual DGX Sparks run Deepseek V4 Flash at 40 tk/s (1M context)
A Reddit user reports 40 tok/s on a single 1M context and 350 tok/s aggregated running Deepseek V4 Flash on two Nvidia DGX Sparks. The setup builds on community optimization work.
·
7 hours ago
