NewsUnited States
OpenGemma 4 adds Multi‑Token Prediction (MTP) — speculative decoding can speed generation up to 3×
Gemma 4 adds experimental Multi-Token Prediction (MTP), using speculative decoding to predict multiple tokens and deliver up to 3× faster generation with no reported quality loss.