Ask HN: What is so good about MCP servers?

12 points by metadat 2h ago 9 comments

Tell HN: Online Safety Act to be enforced in the UK on July 25th

7 points by trycatchthroawy 4h ago 2 comments

Ask HN: Why didn't people 40 years ago worry about population collapse today?

5 points by amichail 4h ago 12 comments

Ask HN: Help me navigate a PIP at a remote startup in the Netherlands

17 points by msoad 15h ago 13 comments

Mineral exploration startups are the tech startups of the physical world

4 points by unicorn_chaser 8h ago 0 comments

Ask HN: How did you navigate an illegal termination?

5 points by infoseekadvice 10h ago 4 comments

Ask HN: How do you avoid AI slop on YouTube?

4 points by npteljes 7h ago 6 comments

Remove All AI Features from Firefox

43 points by nabla9 1d ago 7 comments

Ask HN: Why do Cursor, Windsurf and Claude Code dominate the conversation?

23 points by bluelightning2k 3d ago 35 comments

Google admits system failure but claims "technical impossibility" to fix it

4 points by CBlakeley 6h ago 1 comments

Ask HN: Is anybody using llama.cpp for production?

10 points by HardikVala 23h ago 1 comments

Ask HN: Has anyone deployed LLMs to production?

12 points by saaspirant 1d ago 5 comments

Ask HN: Python developers at big companies what is your setup?

32 points by ravshan 2d ago 32 comments

Ask HN: Why is Gmail so incompetent at basic search?

57 points by sn9 3d ago 57 comments

Ask HN: How many of you are working in tech without a STEM degree?

15 points by zebproj 1d ago 21 comments

I'm Peter Roberts, immigration attorney who does work for YC and startups. AMA

162 points by proberts 6d ago 265 comments

Ask HN: Copilot Makes Me Dumb

4 points by ynarwal__ 1d ago 6 comments

MatrixTransformer: Structural Pattern Discovery Without Training

3 points by AyodeleFikayomi 1d ago 0 comments

Ask HN: What Speaker Diarization tools should I look into?

11 points by justforfunhere 2d ago 8 comments

Ask HN: Any active COBOL devs here? What are you working on?

242 points by _false 6d ago 186 comments

Most interesting job openings according to ChatGPT

4 points by jobswithgptcom 1d ago 3 comments

Ask HN: What's Your Useful Local LLM Stack?

91 points by Olshansky 9d ago 52 comments

Ask HN: What Pocket alternatives did you move to?

125 points by ahmedfromtunis 7d ago 143 comments

Ask HN: How to find non-popular blogs and forums?

25 points by dominicq 3d ago 20 comments

Ask HN: What NAS (Synology) do you use in your home lab as a developer?

11 points by lavren1974 1d ago 10 comments

Ask HN: How have you optimized your company/ work?

10 points by Xx_crazy420_xX 2d ago 7 comments

Getting 5 to 10 spam calls / voicemails - what changed?

8 points by tlogan 2d ago 10 comments

Instant responsiveness in user interfaces is annoying

23 points by zero-sharp 3d ago 28 comments

KeePassXC two factor authentification suddenly fails everywhere

3 points by nilslindemann 1d ago 3 comments

Ask HN: Has anyone deployed LLMs to production?

12 saaspirant 5 7/24/2025, 2:16:40 AM

I have been trying to tune Gemini flash to do some classification for me and it's not performing well at all. I had to change a lot of prompts and still it didn't seem to "learn" anything from the training set. The classification embarrassingly lacks common sense.

Has anyone used AI for anything useful? Apart from programming of course.

Comments (5)

muzani · 1d ago

They're great at first level customer service. Lots of questions are repetitive and they go through this better than humans. It was the biggest boost to customer satisfaction rating.

On the other end, I actually canceled a $100/month subscription once through email (it was company email that I no longer had access too). Gave evidence. It canceled the subscription within 20 mins.

Also gemini flash is unreliable. The best cost efficiency today seems to be gpt-4.1. The cheaper models seem to be okay for summarization mostly. Gemini Flash was much better a year ago, still unreliable, but at least it followed instructions.

byoung2 · 23h ago

I was having trouble getting GPT-4o to extract data like address, email, phone, tracking number from random emails in an inbox. Sometimes it would do it perfectly and other times it would fail miserably on a similar email. Then I tried asking it to first markup the email with schema.org metadata. Then I asked it to extract the data from the schema.org markup. That worked nearly every time.

Maybe there is an extra step you can work into your prompt that would help it get to the proper classification

nkristoffersen · 21h ago

We are using over 50 billion LLM tokens for NLP/classification purposes per month. A mix of self hosted and cloud hosted models. But I have not attempted any fine tuning. Just prompt, (and perhaps more importantly) context “engineering”.

mooreds · 23h ago

We use it heavily for doc search. We bought Kapa.ai a few years ago and leverage their solution, not an in-house build.

incomingpain · 15h ago

I have Microsoft's Phi4 deployed onto https://mapleintel.ca for the AI side. Currently over 44,000 ips in that list.

I tried 'reasoning plus' but it was so much slower.