Hacker News2h agoAccelerating Gemma 4: faster inference with multi-token prediction draftersRead at Hacker News