Ask HN: What "developer holy war" have you flip-flopped on?
9 points by meowface 20h ago 27 comments
Ask HN: How do you connect with other founders in your city?
5 points by leonagano 1d ago 2 comments
Problems in LLM Benchmarking and Evaluation
13 acegod 4 8/15/2025, 3:28:52 PM xent.tech ↗
The best way to measure intelligence is probably to have a model know its strengths and weaknesses, and deal with them in an efficient way. And the most important thing for eval is that ability.