LaunchAI Models
4 hours ago
Cerebras launches Gemma 4 31B at 1,800 tokens per second
Cerebras claims Gemma 4 31B runs at over 1,800 tokens per second, more than 30 times faster than traditional GPU providers. The multimodal model from Google DeepMind is now available on the Cerebras platform.
·
4 hours ago