ChatGPT passed the Turing Test. Now what?

Comments (3)

tjr · 8m ago

Jones and his team performed this experiment with four LLMs. ChatGPT 4.5 was by far the most successful: 73% of participants identified it as the real human. Another model that goes by the unwieldy name LLaMa-3.1-405B was identified as human 56% of the time. (The other two models—ELIZA and GPT-4o—achieved 23% and 21% success rates, respectively, and will not be spoken of again.)

By ELIZA, are they referring to the classic ELIZA? I am not aware of anything new and current with the same name?

If the old ELIZA succeeded 23% of the time, in the context of the other numbers ... that seems ... odd.

allears · 12m ago

Pop science indeed. Nothing new here. The Turing Test was the product of a much earlier era. Our machines today can easily fake a conversation, but there's been little progress in defining what intelligence is, let alone consciousness. Whatever they are, it's clear that LLMs don't have them, and aren't on track to produce them.

NitpickLawyer · 16m ago

Now the turing test isn't a good test. So the goalposts keep on moving. "AI is anything that hasn't been done yet" (quote from 1980s).

We should have the ability to run any code we want on hardware we own (hugotunius.se)

Cognitive load is what matters (github.com)

NPM debug and chalk packages compromised (aikido.dev)

Hosting a website on a disposable vape (bogdanthegeek.github.io)

I didn't bring my son to a museum to look at screens (sethpurcell.com)

Show HN: A store that generates products from anything you type in search (anycrap.shop)

I ditched Docker for Podman (codesmash.dev)

Germany is not supporting ChatControl – blocking minority secured (digitalcourage.social)

30 minutes with a stranger (pudding.cool)

Show HN: Term.everything – Run any GUI app in the terminal (github.com)

Charlie Kirk killed at event in Utah (nbcnews.com)

996 (lucumr.pocoo.org)

Next.js is infuriating (blog.meca.sh)

EU court rules nuclear energy is clean energy (weplanet.org)

Show HN: I recreated Windows XP as my portfolio (mitchivin.com)

The MacBook has a sensor that knows the exact angle of the screen hinge (twitter.com)

Anthropic agrees to pay $1.5B to settle lawsuit with book authors (nytimes.com)

Signal Secure Backups (signal.org)

Using Claude Code to modernize a 25-year-old kernel driver (dmitrybrant.com)

FBI couldn't get my husband to decrypt his Tor node so he was jailed for 3 years (reddit.com)

iPhone Air (apple.com)

Pontevedra, Spain declares its entire urban area a "reduced traffic zone" (greeneuropeanjournal.eu)

I replaced Animal Crossing's dialogue with a live LLM by hacking GameCube memory (joshfonseca.com)

Google can keep its Chrome browser but will be barred from exclusive contracts (cnbc.com)

UTF-8 is a brilliant design (iamvishnu.com)

We all dodged a bullet (xeiaso.net)

Stripe Launches L1 Blockchain: Tempo (tempo.xyz)

Mistral raises 1.7B€, partners with ASML (mistral.ai)

New Mexico is first state in US to offer universal child care (governor.state.nm.us)

Chat Control Must Be Stopped (privacyguides.org)

The treasury is expanding the Patriot Act to attack Bitcoin self custody (tftc.io)

“This telegram must be closely paraphrased before being communicated to anyone” (history.stackexchange.com)

Almost anything you give sustained attention to will begin to loop on itself (henrikkarlsson.xyz)

Where's the shovelware? Why AI coding claims don't add up (mikelovesrobots.substack.com)

Models of European metro stations (stations.albertguillaumes.cat)

Google AI Overview made up an elaborate story about me (bsky.app)

iPhone dumbphone (stopa.io)

Claude Code: Now in Beta in Zed (zed.dev)

Why our website looks like an operating system (posthog.com)

Eternal Struggle (yoavg.github.io)

Corporations are trying to hide job openings from US citizens (thehill.com)

KDE launches its own distribution (lwn.net)

Many hard LeetCode problems are easy constraint problems (buttondown.com)

ICE is using fake cell towers to spy on people's phones (forbes.com)

Hosting a website on a disposable vape (bogdanthegeek.github.io)

Court rejects Verizon claim that selling location data without consent is legal (arstechnica.com)

Claude now has access to a server-side container environment (anthropic.com)

I'm absolutely right (absolutelyright.lol)

Linux phones are more important now than ever (feddit.org)

LLM Visualization (bbycroft.net)

ChatGPT passed the Turing Test. Now what?

Comments (3)