Hacker News
new
|
ask
|
show
|
jobs
GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz
(twitter.com)
21 points
by
laxmena
1 hour ago
|
7 comments
Loading...