HN Reader
Top
New
Best
Ask
Show
Jobs
Top
New
Best
Ask
Show
Jobs
Show HN: Run MMLU benchmark on any LLM endpoint
2
lostmsu
0
5/2/2025, 11:38:15 PM
mmlu.borgcloud.ai ↗
I built it to quickly compare LLM hosting providers to weed out poor quantizations.
Comments (0)
No comments yet
No comments yet