AnalysisMusic
4 hours ago
Cartesia runs real-time voice AI on Together infrastructure

Together AI
@togethercomputeAccelerate inference, model shaping, and pre-training on a research-optimized platform.
San Francisco, CAtogether.ai

Together AI
@togethercompute
.@cartesia runs one of the hardest inference workloads: real-time voice. Their stack has to keep long-lived streams moving, serve millions of audio minutes a day, and hold model latency around 90ms. Together gives them the managed GPU infrastructure and low-level cluster
·
4 hours ago