Back to AIBriefs
AnalysisAI Models

Google's Diffusion Gemma generates 256 tokens at once

The 2-billion parameter model uses image diffusion techniques to produce 256 tokens simultaneously, significantly speeding up local inference. It iteratively denoises masked tokens rather than predicting one at a time, trading some coherence for parallel generation.

Google's Diffusion Gemma generates 256 tokens at once — AIBriefs