Debate Website (bicker.ca)

The current state of LLMs (ChatGPT, Gemini) give the impression of having 'solved digital experience' completely. They are self contained to the extent that the 2023 technique of building wrappers on top of them to customise experiences seems redundant.

I intuitively sense scope for a meld of such intelligence with the physical world.

Are there startups that are building anything cool in this space?

Comments (2)

gtirloni · 2h ago

That's an interesting question but the "AI wrappers" aren't going away because the LLMs 1) aren't totally deterministic and 2) feeding them the correct prompts and context is still very valuable. In other words, one-shotting doesn't work for every use case (which is essentially what your saying when you say they are "self-contained", right? Unfortunately, they aren't/can't be).

Regarding the physical world, that's a deeper question. You have people that say LLM's "understand", that they are "intelligent" and that this is an "emergent behavior" of all their weights. You also have people that say they are nothing more than a stochastic parrot or auto-complete on steroids.

I'm in neither camp but let's do a thought exercise. Multi-modal LLM's are training on text, video, and sound. They can know what a chair looks like, what sound it make if you drag it over a wooden floor, and what it would look like when you do that (from this mysterious PoV somewhere). Now take that "knowledge" and ask it to give you 3D coordinates to move a chair right now in the room you're standing in: it simply can't. It's lacking a lot of information about the actual measurements of the room, its own movement capabilities (or those of the human to carry out the task), etc.

There are AI that can do this, but they aren't good for text. We have self-driving cars and factory robots doing things constrained to those domains.

If you say "meld" as in "let's combine a bunch of different AI technologies together with each one doing what it does best", I'm sure people are working on this already. But LLM's are but a small part of solving that problem.

EDIT: if you still can, please add "Ask HN: " to your title here.

ai_critic · 2h ago

What on earth ever gave you that impression?

Debate Website (bicker.ca)

Show HN: I made a tool that turns niche research into daily marketing tasks (launchprint.deplo.yt)

How we use a 3-stage, human-in-the-loop AI workflow to overhaul rsyslog's docs (rsyslog.com)

The Internal Tooling Maturity Ladder (robbyonrails.com)

My Year of Rust (xavd.id)

Gemma 3 270M (twitter.com)

Art of the Nerd Snipe (lichess.org)

Salmon as Keystone Species (en.wikipedia.org)

Show HN: Modelence – Supabase for MongoDB (github.com)

Dam sabotage blamed on pro-Russia hackers (newsinenglish.no)

The Consistency and Performance of the Iterative Bayesian Update (arxiv.org)

Pro-Russian hackers blamed for water dam sabotage in Norway (bleepingcomputer.com)

We know so little about black holes, I still think we are inside one (bigthink.com)

Futarchy's Fundamental Flaw (dynomight.net)

Trump Reportedly Offering Putin Natural Resources Off Alaska (newsweek.com)

From Stress Test to Skills Test: A Smarter Approach to Technical Interviews (samuelmullen.com)

Gemma 3 270M: The compact model for hyper-efficient AI (developers.googleblog.com)

Show HN: A visual size comparison tool for tech gadgets (comparisontabl.es)

I Made a Realtime C/C++ Build Visualizer (danielchasehooper.com)

AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs? (arxiv.org)

II Lines of Code (kaleidawave.github.io)

Google launches AI-powered flight search tool (blog.google)

CNCF Survey Finds Argo CD as Majority Adopted GitOps Solution for Kubernetes (cncf.io)

Infamous people search site is back (zdnet.com)

Vibe coding platform Anything arrives, our hands-on suggests caution (theregister.com)

Inner speech in motor cortex and implications for speech neuroprostheses (sciencedirect.com)

Anthropic offers Claude to 'all three branches of government' for $1 (techcrunch.com)

Do You Need to Own a House? Many Older Americans Decide They Don't (wsj.com)

What happened to ZIRP = Bitcoin price bots?

Our DevOps stack and recent improvements (ngrok.com)

SmarterMail Docker Image (hub.docker.com)

Apple launching 'redesigned Blood Oxygen feature' on Apple Watch in the U.S. (9to5mac.com)

Vibe Coding as a Non-Technical Founder (elliotboucher.com)

JetBrains working on higher-abstraction programming language (infoworld.com)

How the Widget Revolutionized Canned Beer (hackaday.com)

Show HN: OWhisper – Ollama for realtime speech-to-text (docs.hyprnote.com)

LLMs generate slop because they avoid surprises by design (danfabulich.medium.com)

U.S. Navy's first autonomous ship takes the Port of Everett waters (komonews.com)

Steven Heller's font of the month: Experimo (ilovetypography.com)

From Gucci to Rolex: The Rise and Fall of Luxury in Music (bloomberg.com)

Highway, a C++ library that provides portable SIMD/vector intrinsics (google.github.io)

Show HN: Toy regex engine, written in Zig (github.com)

Gag Value (gagvalue.online)

Relay-BP: Fastest, most accurate decoder for qLDPC error correction codes (ibm.com)

Oreos Combined with Reese's? Inside the Manhattan Project of Snacks (wsj.com)

Show HN: Drill Down – Keep readers in flow while adding depth (youtube.com)

Build Canada (buildcanada.com)

Show HN: I tried to build an Awwwards-level CSS to Nested CSS converter (nestyourcss.com)

The Loeb Scale: Astronomical Classification of Interstellar Objects (arxiv.org)

The Drugs Are Taking Hold (blog.dshr.org)

Anyone melding GPT-level intelligence with physical world?

Comments (2)