Cerebras launches Gemma 4 31B at 1,800 tokens per second

LaunchAI Models

4 hours ago

Cerebras launches Gemma 4 31B at 1,800 tokens per second

Cerebras claims Gemma 4 31B runs at over 1,800 tokens per second, more than 30 times faster than traditional GPU providers. The multimodal model from Google DeepMind is now available on the Cerebras platform.

4 hours ago