Hacker News
new
|
ask
|
show
|
jobs
Token-Count-Based Batching: Faster, Cheaper Embedding Inference for Queries
(mongodb.com)
1 point
by
fzliu
13 hours ago
|
discuss