New Aging Clock Forecasts Dementia, Disease Risk from Single MRI Scan (insideprecisionmedicine.com)

1 points by gscott 1m ago 0 comments

Is this a real woman? AI model in Vogue raises concerns about beauty standards (bbc.com)

1 points by gnabgib 4m ago 0 comments

Teznewz : AI-Powered Fin News & Social Sentiment for Retail Investor (teznewz.com)

1 points by sixteen_dev 8m ago 1 comments

Vibe coded a Workout Wrapped Website to summarize my yearly gains (wodwrapped.netlify.app)

1 points by dominickgurnari 9m ago 1 comments

Anywhere on Earth (en.wikipedia.org)

2 points by sandinmyjoints 22m ago 0 comments

Dana Morgan Jr. (deadessays.blogspot.com)

1 points by tkgally 24m ago 0 comments

LLM Economist – Mechanism Design for Simulated Agent Societies (github.com)

2 points by milkkarten 25m ago 1 comments

Building Testable Telegram Bots with Zustand (zwit.link)

1 points by gicrisf 30m ago 0 comments

Typed languages are better suited for vibecoding (solmaz.io)

1 points by hosolmaz 32m ago 0 comments

OldTimeyComputerShow: 24/7 curated video tapes/films on computers/games [video] (twitch.tv)

1 points by dijksterhuis 34m ago 0 comments

Physicists disagree on what quantum mechanics says about reality (nature.com)

1 points by danielam 34m ago 0 comments

BackupGuardian-Stop trusting DB backups that fail during critical migrations (backupguardian.org)

1 points by neural_drift 34m ago 1 comments

Google's DMCA Transparency Report 'Freezes' After Recent Volume Surge (torrentfreak.com)

2 points by gslin 35m ago 0 comments

Europe's cocoa slowdown highlights global chocolate struggle (japantimes.co.jp)

1 points by PaulHoule 38m ago 0 comments

The Killing Code (theguardian.com)

1 points by kawera 38m ago 0 comments

Efforts to Ground Physics in Math Are Opening the Secrets of Time (wired.com)

1 points by colinprince 41m ago 0 comments

First-Ever Antimatter Qubit Could Help Crack Cosmic Mysteries (scientificamerican.com)

1 points by stared 43m ago 0 comments

Project Follow Through (nifdi.org)

1 points by indigodaddy 49m ago 0 comments

Show HN: Avrprices (avrprices.com)

1 points by Cicero22 53m ago 0 comments

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving (arxiv.org)

1 points by jonbaer 55m ago 0 comments

Refrigerator Hibernation for Tortoises and Turtles (tortoisetrust.org)

1 points by thunderbong 58m ago 0 comments

Ask HN: How do you personally evaluate new LLM models?

2 points by _samjarman 59m ago 0 comments

We Built an Auto-Aiming Trash Can [video] (youtube.com)

1 points by dangoodmanUT 1h ago 0 comments

Mastercard denies pressuring game platforms, Valve tells a different story (techcrunch.com)

2 points by DoctorOW 1h ago 0 comments

Next-gen digital loyalty and automated marketing for small businesses (chckn.app)

1 points by simgooder 1h ago 1 comments

Render your Jupyter notebooks in OpenGist (blog.fabiomanganiello.com)

1 points by todsacerdoti 1h ago 0 comments

I'm working to switch from wget to curl (due to Fedora) (utcc.utoronto.ca)

1 points by josephcsible 1h ago 0 comments

Harry K. Daghlian, Jr.: America's First Peacetime Atom Bomb Fatality (members.tripod.com)

1 points by marcodiego 1h ago 0 comments

Ocean Darkening Is Causing Marine Habitats to Shrink (sevenseasmedia.org)

1 points by alechewitt 1h ago 0 comments

Show HN: Andre – A privacy-first, location-aware assistant that helps you (andreapp.org)

2 points by billellsd 1h ago 0 comments

Learnable Programming (2012) (worrydream.com)

3 points by kunzhi 1h ago 0 comments

Names are not type safety (2020) (lexi-lambda.github.io)

8 points by azhenley 1h ago 0 comments

How to grow almost anything (howtogrowalmostanything.notion.site)

22 points by car 1h ago 3 comments

SubtitleWise – Complete Subtitle Management Suite and Translation (subtitlewise.com)

2 points by mirlind 1h ago 1 comments

Writers Jam (writersjam.shantaram.xyz)

1 points by rishikeshs 1h ago 0 comments

What does AI progress mean for medical progress? (blog.jacobtrefethen.com)

1 points by zerealshadowban 1h ago 0 comments

Ask HN: Can no-code AI workflow tools replace runbooks?

1 points by rkalahasty 1h ago 0 comments

AnyCable for Laravel: reliable WebSocket infrastructure (evilmartians.com)

3 points by mooreds 1h ago 0 comments

A study of lights at night suggests dictators lie about economic growth (2022) (economist.com)

54 points by mooreds 1h ago 18 comments

Google users are less likely to click on links when an AI summary is in results (pewresearch.org)

6 points by mooreds 1h ago 2 comments

Private and HIPAA Compliant LLM Summarizer for Apple Health's Medical Records (timeline.yari.care)

1 points by amiroo 1h ago 1 comments

AI Automation Tools: The Future of Web Development (github.com)

1 points by kim11 1h ago 0 comments

The Responsibility of Engineers in the Age of AI (julien.ch)

2 points by julien-may 1h ago 1 comments

Can AI 'Feel' Guilt? (sciencenews.org)

3 points by pseudolus 1h ago 0 comments

Crustal faulting drives biological redox cycling in the deep subsurface (science.org)

2 points by PaulHoule 1h ago 0 comments

Do photons take all paths or not? (physics.stackexchange.com)

2 points by imadr 1h ago 1 comments

Little Dig Game (little-dig-ga.me)

1 points by memalign 1h ago 0 comments

Figma CEO's path from college dropout and Thiel fellow to tech billionaire (cnbc.com)

3 points by TMWNN 1h ago 0 comments

Speech may have a universal transmission rate: 39 bits per second (science.org)

3 points by Bluestein 2h ago 0 comments

3D Printing Network

2 points by Ryg1994 2h ago 0 comments

AI hallucinations will be solvable within a year (2024)

10 rvz 13 8/3/2025, 7:53:08 PM fortune.com ↗

Comments (13)

Wowfunhappy · 18m ago

> “If you look at the models before they are fine-tuned on human preferences, they’re surprisingly well calibrated. So if you ask the model for its confidence to an answer—that confidence correlates really well with whether or not the model is telling the truth—we then train them on human preferences and undo this.

Now that is really interesting! I didn't realize RLHF did that.

davesmylie · 3h ago

Obviously it's well over a year since this article was posted and if anything I've anecdotally noticed hallucinations getting more, not less, common.

Possibly/probably with another years experience with LLMs I'm just more attuned to noticing when they have lost the plot and are making shit up

DoctorOetker · 24m ago

In the article it is argued that the brainfarts could be beneficial for exploration of new ideas.

I don't agree. The "temperature" parameter should be used for this. Confabulation / bluff / hallucination / unfounded guesses are undesirable at low temperatures.

ipv6ipv4 · 3h ago

There is ample evidence that hallucinations are incurable in the best extant model of intelligence - people.

add-sub-mul-div · 2h ago

Someday we'll figure out how to program computers to behave deterministically so that they can complement our human abilities rather than badly impersonate them.

alkyon · 2h ago

The more accurate word would be confabulation

Wowfunhappy · 2h ago

You lost this battle, sorry. It's not going to happen.

Both terms are "inaccurate" because we're talking about a computer program, not a person. However, at this point "hallucination" has been firmly cemented in public discourse. I don't work in tech, but all of my colleagues know what an AI hallucination is, as does my grandmother. It's only a matter of time until the word's alternate meaning gets added to the dictionary.

alkyon · 1h ago

Maybe I lost this battle, but also in science the terminology evolves. If you replace AI hallucination with AI confabulation even your grandmaother would get it right. I also don't agree that both terms are equally inaccurate.

peterashford · 1h ago

Correct. This is the way language works. It's annoying when you know what words mean but this is the way it is.

alkyon · 1h ago

Obviously, hallucination is by definition a perception, so it incorrectly anthropomorphizes AI models. On the other hand, the term confabulation involves filling in gaps with fabrication, exactly what LLMs do (aka bullshitting).

more_corn · 1h ago

What an absurd prediction.

techpineapple · 4h ago

I wonder if it would be better to have 1 “perfect” LLM trying to solve problems or 5 intentionally biased LLM’s.

d00mB0t · 4h ago

I'm so tired of these rich dweebs pontificating to everyone.