LLM Speedrunner: Eval for frontier models to reproduce scientific findings

1 zerojames 0 6/27/2025, 12:34:35 PM github.com ↗

Comments (0)

No comments yet