Ask HN: Why don't LLMs replace bosses instead of engineers?

12 points by fzeindl 4h ago 12 comments

Why do dev tools crush it on Product Hunt but never seem to raise money?

3 points by alexandratabone 5h ago 3 comments

Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?

544 points by superasn 3d ago 358 comments

Gemini's brutal assessment of a vibe coding session

7 points by adampwells 10h ago 1 comments

Ask HN: What toolchains are people using for desktop app development in 2025?

101 points by lincoln20xx 2d ago 112 comments

ASK: Could AI Replace the Slush Pile Intern?

2 points by richardatlarge 11h ago 7 comments

We'll need a universal basic income (UBI) in an AI-driven world

17 points by mockingloris 20h ago 38 comments

Ask HN: What trick of the trade took you too long to learn?

381 points by unsupp0rted 7d ago 667 comments

Snapchat open source cross-platform mobile framework. Looking for beta testers

12 points by FactoryReboot 18h ago 1 comments

Ask HN: Why Is My Happiness Tied to My Productivity?

14 points by hnquestion12345 14h ago 18 comments

Ask HN: With all the AI hype, how are software engineers feeling?

89 points by cpt100 1d ago 181 comments

Ask HN: Do you do anything with the "cool" languages that get posted here?

3 points by AstroJetson 17h ago 2 comments

GitHub Outage?

10 points by U1F984 18h ago 5 comments

Tell HN: Regulations.gov Comments API is shutting down on Friday

8 points by sadmiralakbar 19h ago 1 comments

Tell HN: Anthropic expires paid credits after a year

272 points by maytc 7d ago 136 comments

Google's RCS disconnected in several countries

6 points by hocuspocus 20h ago 0 comments

Ask HN: Has any of the Pivotal Tracker replacement attempts succeeded?

47 points by admissionsguy 8d ago 35 comments

What's your favorite CLI tool for integrating LLMs into your terminal workflow?

10 points by menisadi 2d ago 7 comments

Ask HN: Has anyone built anything useful using AI?

5 points by bapak 1d ago 16 comments

Ask HN: What tech skill gave you the biggest boost in your career?

6 points by doppelgunner 23h ago 10 comments

Ask HN: Canadian founders, how do you build in SF?

8 points by changisaac 2d ago 3 comments

Vectorless: open-source PDF chatbot without RAG

4 points by richardmeng 1d ago 4 comments

Ask HN: Advice for someone who wants to try AI-assisted coding?

8 points by inglor_cz 1d ago 19 comments

Ask HN: What are some comfy/stress-free jobs a SWE can do? (LCOL country)

7 points by ejlanor 1d ago 16 comments

Ask HN: What do you dislike about ChatGPT and what needs improving?

33 points by zyruh 5d ago 125 comments

Does anyone know a detailed residential cost estimator

3 points by morpheos137 1d ago 0 comments

Ask HN: Why is Usenet not coming back?

15 points by Fabeltjeskrant 1d ago 15 comments

Ask HN: Best way to get a land line for my kids?

5 points by xrd 1d ago 24 comments

Comparing 6M Feature Selection Methods on Credit Risk Data

2 points by Cermank 20h ago 4 comments

ChatGPT 5 is slow and no better than 4

60 points by iwontberude 2d ago 44 comments

Ask HN: What's Going on with AI Psychosis?

9 points by hhh 1d ago 2 comments

Ask HN: How would you build second brain in the AI era?

9 points by divan 3d ago 5 comments

Feature Request: "Copy" Button Should Copy Only Main Output

3 points by vezycash 2d ago 2 comments

GPT5 is worse than 4.1-mini for text and worse than Sonnet 4 for coding

9 points by hitradostava 2d ago 16 comments

Ask HN: In which programming language is it better to make your own language?

8 points by Forgret 2d ago 19 comments

Tell HN: Charles Irby has passed away

32 points by steven123 4d ago 4 comments

ChatGPT-5 Can't Do Basic Math

16 points by MarcellusDrum 3d ago 16 comments

GPT-5 streaming requires submission of biometric data

35 points by binarymax 3d ago 7 comments

Ask HN: Are you running local LLMs? What are your key use cases?

15 points by briansun 3d ago 13 comments

LLM Evals Are Just Tests. Why Are We Making This So Complicated?

3 camwest 2 8/10/2025, 3:23:48 AM cameronwestland.com ↗

Comments (2)

8organicbits · 2d ago

So, did the tests allow you to build a system that never confused existing features with new features? That seems like the problem statement, but I think I'm only seeing probabilistic testing.

camwest · 1d ago

Never? No. Way less likely? Yes!

In dev we do 100 consistency checks and get green. In CI we do 10.