Back to AIBriefs
LaunchAI Models

DiffusionGemma: 4x faster text generation

DiffusionGemma is a 26B MoE model (3.8B active) that generates text via diffusion, achieving up to 4x speedup over autoregressive Gemma 4 models (1000+ tokens/s on H100, 700+ on RTX 5090). Released under Apache 2.0, it is designed for speed-critical local workflows like in-line editing and code infilling.

DiffusionGemma: 4x faster text generation — AIBriefs