Large Language Models as Markov Chains [video] (youtube.com)

> Despite being trained on more compute than GPT-3, AlphaGo Zero could only play Go, while GPT-3 could write essays, code, translate languages, and assist with countless other tasks. The main difference was training data.

This is kind of weird and reductive, comparing specialist to generalist models? How good is GPT3’s game of Go?

The post reads as kind of… obvious, old news padding a recruiting post? We know OpenAI started hiring the kind of specialist workers this post mentions, years ago at this point.

rcxdude · 46m ago

Also, the main showcase of the 'zero' models was that they learnt with zero training data: the only input was interacting with the rules of the game (as opposed to learning to mimic human games), which seems to be the kind of approach the article is asking for.

9rx · 1h ago

> This is kind of weird and reductive, comparing specialist to generalist models

It is even weirder when you remember that Google had already released Meena[1], which was trained on natural language...

[1] And BERT before it, but it is less like GPT.

atrettel · 11m ago

I am quite happy that this post argues in favor of subject-matter expertise. Until recently I worked at a national lab. I had many people (both leadership and colleagues) tell me that they need fewer if any subject-matter experts like myself because ML/AI can handle a lot of those tasks now. To that effect, lab leadership was directing most of the hiring (both internal and external) towards ML/AI positions.

I obviously think that we still need subject-matter experts. This article argues correctly that the "data generation process" (or as I call it, experimentation and sampling) requires "deep expertise" to guide it properly past current "bottlenecks".

I have often phrased this to colleagues this way. We are reaching a point where you cannot just throw more data at a problem (especially arbitrary data). We have to think about what data we intentionally use to make models. With the right sampling of information, we may be able to make better models more cheaply and faster. But again, that requires knowledge about what data to include and how to come up with a representative sample with enough "resolution" to resolve all of the nuances that the problem calls for. Again, that means that subject-matter expertise does matter.

jrimbault · 1h ago

> This meant that while Google was playing games, OpenAI was able to seize the opportunity of a lifetime. What you train on matters.

Very weird reasoning. Without AlphaGo, AlphaZero, there's probably no GPT ? Each were a stepping stone weren't they?

vonneumannstan · 1h ago

>Very weird reasoning. Without AlphaGo, AlphaZero, there's probably no GPT ? Each were a stepping stone weren't they?

Right but wrong. Alphago and AlphaZero are built using very different techniques than GPT type LLMs. Google created Transformers which leads much more directly to GPTs, RLHF is the other piece which was basically created inside OpenAI by Paul Cristiano.

jimbo808 · 52m ago

Google Brain invented transformers. Granted, none of those people are still at Google. But it was a Google shop that made LLMs broadly useful. OpenAI just took it and ran with it, rushing it to market... acquiring data by any means necessary(!)

msp26 · 54m ago

OpenAI's work on Dota was also very important for funding

phreeza · 1h ago

Transformers/Bert yes, alphago not so much.

rob74 · 1h ago

It's kind of reassuring that the old adage "garbage in, garbage out" still applies in the age of LLMs...

Large Language Models as Markov Chains [video] (youtube.com)

Inteligov Uses FusionAuth for Custom SSO, Opening Up a New Revenue Stream (fusionauth.io)

GPT5 Release Soon (newsweek.com)

Ask HN: Using Stripe Atlas to start a LLC for a small side project?

Human in the Loop for AI Pentesting Co-Pilot (old.reddit.com)

Show HN: Creating a Binary Puzzle Game (taengo.vercel.app)

Victims of CIA-linked Montreal brainwashing experiments cleared to sue (cbc.ca)

Nixpkgs module system config modules graph (discourse.nixos.org)

Moving on from Neovim (amanazad.xyz)

Flat design vs. realistic ("skeuomorphic") design (flatisbad.com)

The Work of Raj Chetty (nicholasdecker.substack.com)

Show HN: MIT License Rust Accessibility-Based Computer Use SDK+mcp (github.com)

Algorithmic Collusion of Pricing and Advertising on E-Commerce Platforms (papers.ssrn.com)

What to Watch (Or Not): Ballard, Perfect Days, Billy Joel (marginalrevolution.com)

Open SWE: An Open-Source Asynchronous Coding Agent (blog.langchain.com)

Show HN: Create DJ-like transitions between any two songs (sfxengine.com)

A defense of learning Latin and Greek (americamagazine.org)

Show HN: Train – AI Workout Companion That Helps You Train Smarter (train-fit.vercel.app)

What is it? White balloon object spotted over Anchorage (alaskasnewssource.com)

A new worst coder has entered the chat: vibe coding without code knowledge (stackoverflow.blog)

Legendary GPU architect Raja Koduri's startup leverages RISC-V and targets CUDA (tomshardware.com)

Retab: The developer starter pack for document processing (retab.com)

Ditching GitHub (tomscii.sig7.se)

Is Universal Basic Income Effective? Not Really (city-journal.org)

PyPI: Preventing ZIP parser confusion attacks on Python package installers (blog.pypi.org)

Live: GPT-5 (youtube.com)

The Sunlight Budget of Earth (asimov.press)

Show HN: FocusTree – a simple task app (prototype), free open source (github.com)

GPT-5 Coding Examples (github.com)

I built a site to surface mind-blowing, underrated websites (offscopes.com)

ZFSBootMenu (zfsbootmenu.org)

1h The NDA for the Framework Desktop reviews has been lifted (community.frame.work)

Think Linux desktop market share isn't over 6%? This scan says otherwise (zdnet.com)

Is your brain necessary for consciousness? (iai.tv)

US to levy 100% tariff on imported chips, but some firms exempt (reuters.com)

EyJaafCsubstantially: Cramming English words into JSON web tokens (tesseral.com)

Go-2025-3849: Incorrect results returned from Rows.Scan in database/SQL (pkg.go.dev)

TimescaleDB 2.21 – 42× Faster DELETEs (tigerdata.com)

Password Pusher: Share secrets securely with self-deleting links and audit logs (docs.pwpush.com)

Patch now: Dell PCs with Broadcom chips vulnerable to attack (theregister.com)

Google Confirms It Has Been Hacked – Warns User Data Stolen (forbes.com)

Address Formats Around the World (w3c.github.io)

Seeing the Bad Helps You Spot the Good (newsletter.eng-leadership.com)

Freezing rent is easy. Making NYC housing affordable isn't (japantimes.co.jp)

Show HN: Browsernode – Open-source TS browser agent(browser-use compatible) (github.com)

OpenJBOD (github.com)

Ask HN: Any advice to get my first job as junior fullstack dev?

AI-powered news aggregation platform (northcodic.blogspot.com)

PyModeS: Python decoder for Mode S and ADS-B signals (github.com)

Lyten to Acquire Northvolt (lyten.com)

Sweatshop Data Is Over

Comments (10)