AnalysisAI Models
Jun 14, 5:03 PM
DeepSeek V4 Pro tops Artificial Analysis speed/latency charts on Together AI

Together AI
@togethercomputeAccelerate inference, model shaping, and pre-training on a research-optimized platform.
San Francisco, CAtogether.ai

Together AI
@togethercompute
DeepSeek V4 Pro on Together AI is now #1 on Artificial Analysis for both output speed and latency. Serving V4 well is an inference systems problem: KV cache, prefix reuse, kernels, and endpoint profiles. We break down the systems work here: https://t.co/RLHi35DFif

·
Jun 14, 5:03 PM