Google Gemma Gains Speculative Decoding

Mountain View — Google's Gemma 4 hits 3x faster output. A 74-million-parameter drafter proposes tokens that Gemma 4 verifies in parallel, cutting latency on consumer hardware, Google said. Gemma 4 shi

Ars Technica