Show HN: LLM Benchmarking Suite

2 Dhyaneesh 0 3/27/2025, 9:26:33 AM github.com ↗
A comprehensive benchmarking suite for evaluating Gemma and other language models on various benchmarks including MMLU (Massive Multitask Language Understanding) and GSM8K (Grade School Math 8K).

Comments (0)

No comments yet