LLM-D: Kubernetes-Native Distributed Inference at Scale

10 bbzjk7 2 5/20/2025, 11:55:18 PM github.com ↗

Comments (2)

xianshou · 10h ago
Kemschumam · 9h ago
What would be the benefit of this project over hosting VLLM in Ray?