Give Me FP32 or Give Me Death?

3 spindump8930 2 6/25/2025, 3:34:21 PM arxiv.org ↗

Comments (2)

spindump8930 · 10h ago
People often don't understand why LLMs can be non deterministic even with deterministic seeding, temperature, sampling. This paper shows how bad it can be with different hardware and gpu hosts.
incomingpain · 10h ago
Q4 is plenty for me, I dont have the budget for FP32 lol.

If money wasnt a thing, id probably not be going above Q8.