Hacker News
new
|
ask
|
show
|
jobs
Is One Layer Enough? A Single Transformer Layer Matches Full-Parameter RL Train
(arxiv.org)
88 points
by
tcp_handshaker
5 hours ago
|
20 comments
Loading...