HN Reader
Top
New
Best
Ask
Show
Jobs
Top
New
Best
Ask
Show
Jobs
LLM Speedrunner: Eval for frontier models to reproduce scientific findings
1
zerojames
0
6/27/2025, 12:34:35 PM
github.com ↗
Comments (0)
No comments yet
No comments yet