Hacker News
new
|
ask
|
show
|
jobs
Accelerating Gemma 4: faster inference with multi-token prediction drafters
(blog.google)
584 points
by
amrrs
19 hours ago
|
273 comments
Loading...