Hacker News
new
|
ask
|
show
|
jobs
ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on Math
(firethering.com)
18 points
by
steveharing1
2 hours ago
|
17 comments
Loading...