Show HN: sllm – Split a GPU node with other developers, unlimited tokens

Show HN: sllm – Split a GPU node with other developers, unlimited tokens (sllm.cloud)

112 points by jrandolf 9 hours ago | 63 comments

Running DeepSeek V3 (685B) requires 8×H100 GPUs which is about $14k/month. Most developers only need 15-25 tok/s. sllm lets you join a cohort of developers sharing a dedicated node. You reserve a spot with your card, and nobody is charged until the cohort fills. Prices start at $5/mo for smaller models.

The LLMs are completely private (we don't log any traffic).

The API is OpenAI-compatible (we run vLLM), so you just swap the base URL. Currently offering a few models.