We ran a Unix-like OS Xv6 on our home-built CPU with a home-built C compiler (2020) (fuel.edby.coffee)

This is a cool concept, but for comparison, I can’t help but wish there was more comparison between the treatment group and a control group that doesn’t see any universal pretraining data.

It’s good to compare various model sizes and evaluation tasks and random data generators. I just think the paper would more effectively prove its point if it could show models of same sizes which see this random data can learn better from evaluation data later on.

Could even take the initial checkpoint of the model before universal pretraining against the pretrained checkpoint. If the method works, the one that did UP will win.

Maybe I’m way off, I’ll admit I only skimmed it so far. Seems promising, just wishing for some controls.

liamdgray · 4h ago

Abstract: "We investigate the use of randomly generated data for the sake of pre-training a model. We justify this approach theoretically from the perspective of algorithmic complexity, building on recent research that shows that sequence models can be trained to approximate Solomonoff induction. We derive similar, but complementary theoretical results. We show empirically that synthetically generated data can be used to pre-train a model before the data is seen. We replicate earlier results that models trained this way show zero-shot in-context learning across a variety of datasets, and that this performance improves with scale. We extend earlier results to real-world data, and show that finetuning a model after pre-training offers faster convergence and better generalization."

JavaScript Trademark Update (deno.com)

Solving `Passport Application` with Haskell (jameshaydon.github.io)

MCP: An (Accidentally) Universal Plugin System (worksonmymachine.substack.com)

The Death of the Middle-Class Musician (thewalrus.ca)

Addictions Are Being Engineered (masonyarbrough.substack.com)

We ran a Unix-like OS Xv6 on our home-built CPU with a home-built C compiler (2020) (fuel.edby.coffee)

Life of an inference request (vLLM V1): How LLMs are served efficiently at scale (ubicloud.com)

BusyBeaver(6) Is Quite Large (scottaaronson.blog)

Group of investors represented by YouTuber Perifractic buys Commodore (amiga-news.de)

2025 ARRL Field Day (arrl.org)

Community Is Motivation on Tap (alanwu.xyz)

Universal pre-training by iterated random computation (arxiv.org)

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights (jameshard.ing)

An Indoor Beehive in My Bedroom Wall (keepingbackyardbees.com)

Blackwell: Nvidia's GPU (chipsandcheese.com)

Show HN: SVG Lined Tile Generator (adpreese.github.io)

Gradient Descent Visualiser (uclaacm.github.io)

Tennis Scorigami (tennis-scorigami.com)

Generative AI's crippling failure to induce robust models of the world (garymarcus.substack.com)

Refurb weekend: Gremlin Blasto arcade board (oldvcr.blogspot.com)

Show HN: AGL a toy language that compiles to Go (github.com)

Is being bilingual good for your brain? (economist.com)

Sirius: A GPU-native SQL engine (github.com)

Parsing JSON in Forty Lines of Awk (akr.am)

Against AI: An Open Letter from Writers to Publishers (lithub.com)

Memory Safe Languages: Reducing Vulnerabilities in Modern Software Development [pdf] (media.defense.gov)

Finding Peter Putnam (nautil.us)

The European wood pigeon helped me appreciate its omnipresent city cousins (nytimes.com)

Show HN: Vet – A tool for safely running remote shell scripts (getvet.sh)

Infrastructure at Roblox (corp.roblox.com)

Exploring Trichromacy through Maxwell's Color Experiment (2023) (maxwell.kohterai.com)

Schizophrenia Is the Price We Pay for Minds Poised Near the Edge of a Cliff (psychiatrymargins.com)

The Great Illusion: When We Believed BeOS Would Save the World (desktoponfire.com)

IDF officers ordered to fire at unarmed crowds near Gaza food distribution sites (haaretz.com)

The right thing for the wrong reasons: FLOSS doesn't imply security (2022) (seirdy.one)

The Coming Technological Singularity, by Vernor Vinge (1993) (edoras.sdsu.edu)

Unheard works by Erik Satie to premiere 100 years after his death (theguardian.com)

Lago (Open-Source Usage Based Billing) is hiring for ten roles (ycombinator.com)

After successfully entering Earth's atmosphere, a European spacecraft is lost (arstechnica.com)

A literary magazine accessible only via telnet

The Book Cover Trend of Text on Old Paintings (nytimes.com)

ZeQLplus: Terminal SQLite Database Browser (github.com)

Satellites keep breaking up in space. Insurance won't cover them (space.com)

Lossless LLM 3x Throughput Increase by LMCache (github.com)

Evaluating Long-Context Question and Answer Systems (eugeneyan.com)

LLMs Bring New Nature of Abstraction (martinfowler.com)

I deleted my second brain (joanwestenberg.com)

No One Is in Charge at the US Copyright Office (wired.com)

US Defense Department will stop providing satellite weather data (text.npr.org)

Experimental X11 Compatibility Layer (github.com)

Universal pre-training by iterated random computation

Comments (2)