Show HN: Run MMLU benchmark on any LLM endpoint

2 lostmsu 0 5/2/2025, 11:38:15 PM mmlu.borgcloud.ai ↗
I built it to quickly compare LLM hosting providers to weed out poor quantizations.

Comments (0)

No comments yet