LaunchAI Models
Jun 10, 4:24 PM
DiffusionGemma: 4x faster text generation
DiffusionGemma is a 26B MoE model (3.8B active) that generates text via diffusion, achieving up to 4x speedup over autoregressive Gemma 4 models (1000+ tokens/s on H100, 700+ on RTX 5090). Released under Apache 2.0, it is designed for speed-critical local workflows like in-line editing and code infilling.
Jun 10, 4:24 PM
