Mountain View — DiffusionGemma generates whole text blocks, not tokens. Standard models predict one token at a time; DiffusionGemma denoises a full block in parallel, reaching 4× speed and 1,000 token
Ars Technica