Back to AIBriefs
LaunchAI Models

Cerebras launches Gemma 4 31B at 1,800 tokens per second

Cerebras claims Gemma 4 31B runs at over 1,800 tokens per second, more than 30 times faster than traditional GPU providers. The multimodal model from Google DeepMind is now available on the Cerebras platform.

·
4 hours ago
Cerebras launches Gemma 4 31B at 1,800 tokens per second — AIBriefs