LLM-D: Kubernetes-Native Distributed Inference at Scale
10 bbzjk7 2 5/20/2025, 11:55:18 PM github.com ↗
Comments (2)
xianshou · 11h ago
Duplicate of https://news.ycombinator.com/item?id=44040883
Kemschumam · 11h ago
What would be the benefit of this project over hosting VLLM in Ray?