Hacker News
new
|
ask
|
show
|
jobs
VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO
(arxiv.org)
325 points
by
timhigins
16 hours ago
|
171 comments
Loading...