US vs. Google amicus curiae brief of Y Combinator in support of plaintiffs [pdf] (storage.courtlistener.com)

Their point is that the name and logo are clearly drawing from the Metamorphosis of Prime Intellect, with all the potential baggage that comes with it. It's an interesting choice.

iTokio · 6h ago

It’s interesting that it does something useful (training a LLM) without trust and in a decentralized way.

Maybe this could be used as proof of work? To stop wasting computing resources in crypto currencies and get something useful as a byproduct.

_ink_ · 3h ago

I read an argument, that proof of work needs to be useless and wasteful. If it would produce value in itself it would make 51% attacks more economic and thus the currency less secure.

throwanem · 1h ago

Sure. The whole point of "proof of work" is to show (prove) you've lost energy to heat (work). That's what makes it costly and thus an honest signal.

The model breaks where work can be counterfeited (usually impossible) or where energy prices go to zero, which is why "bitcoin colonialism" was briefly a thing last decade. Much of bitcoin's design, this aspect also, is intended to protect against the bare-fanged, red-eyed money weasels it was also designed to attract.

ucha · 1h ago

It needs to not have economic value but it doesn't necessarily need to be useless and wasteful.

Geee · 3h ago

No, this process doesn't produce "proof of work", i.e. verifiable proofs that energy has been used.

No comments yet

fastball · 5h ago

The emphasis is indeed on "without trust" – as far as I can tell this project is unable to verify whether the decentralized training nodes are contributing productively.

Without the ability to validate that training compute is heading in the globally desired direction, it is unlikely you could use it as the foundation of a (sound) cryptocurrency.

mentalgear · 4h ago

The reward model could be used as a validation/reward for the client. Give the same nodes the same inferences to make, and the one with the highest reward (those could be short, or even partially calculated long-term) will also get the "currency" reward.

Philpax · 1m ago

That sounds like it'll lead to human-driven reward hacking [0]?

[0]: https://en.wikipedia.org/wiki/Reward_hacking

k__ · 1h ago

Arweave and Filecoin use PoW algorithms that prove something useful.

mentalgear · 4h ago

That would be indeed a very promising way of FINALLY making cryptocurrency useful!

proof_by_vibes · 5h ago

There could be merit to this. Proofs are generally computationally hard, so it's possible that a currency could be created by quantifying verification.

littlestymaar · 5h ago

> To stop wasting computing resources in crypto currencies and get something useful as a byproduct.

Bitcoin is the only major cryptocurrency that still use proof of work today (others are either using “proof of stakes” or are “Layer 2” chains), and due to its (relative lack of) governance structure, it's very unlikely to ever change.

Thomashuet · 3h ago

Summary: We've use the most complexest, buzzwordiest training infrastructure to increase the performance of our base model by a whopping 0.5% (±1%).

Weryj · 1h ago

But this isn’t about the performance, the infrastructure is the product here.

lonelyasacloud · 3m ago

Indeed, most reliable way to make money in a gold rush is to sell shovels.

3abiton · 6h ago

This is rather exciting! I see the future of Co-op models made by a community of experts on a specific field that would still allow them to be competitive with "AI monopolies". Maybe not all hope is lost!

danielhanchen · 6h ago

I made some GGUFs at https://huggingface.co/unsloth/INTELLECT-2-GGUF

./llama.cpp/llama-cli -hf unsloth/INTELLECT-2-GGUF:Q4_K_XL -ngl 99

Also it's best to read https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-e... on sampling issues for QwQ based models.

Or TLDR, use the below settings:

./llama.cpp/llama-cli -hf unsloth/INTELLECT-2-GGUF:Q4_K_XL -ngl 99 --temp 0.6 --repeat-penalty 1.1 --dry-multiplier 0.5 --min-p 0.00 --top-k 40 --top-p 0.95 --samplers "top_k;top_p;min_p;temperature;dry;typ_p;xtc"

abtinf · 7h ago

Does this have anything to do with The Metamorphosis Of Prime Intellect, or did they just abuse the name and the cover art?

arthurcolle · 7h ago

Prime Intellect is a grabby AI :)

refulgentis · 8h ago

I guess I'm bearish?

It's not that they trained a new model, but they took an existing model and RL'd it a bit?

The scores are very close to QwQ-32B, and at the end:

"Overall, as QwQ-32B was already extensively trained with RL, it was difficult to obtain huge amounts of generalized improvement on benchmarks beyond our improvements on the training dataset. To see stronger improvements, it is likely that better base models such as the now available Qwen3, or higher quality datasets and RL environments are needed."

fabmilo · 8h ago

The interesting delta here is that this proves that we can distribute the training and get a functioning model. The scaling factor is way bigger than datacenters

comex · 6h ago

But does that mean much when the training that produced the original model was not distributed?

refulgentis · 6h ago

The RL, not the training. No?

itchyjunk · 29m ago

RL is still training. Just like pretraining is still training. SFT is also training. This is how I look at it. Models weights are being updated in all cases.

christianqchung · 7h ago

Third party fine tuned open weighted LLMs tend to be good at a handful of benchmarks, but parity or lower on others compared to the original model. There are some exceptions like Nvidia's Nemotron series, but the differences generally are so small as to be imperceptible. Deepseek released finetunes of several Qwen and Llama models alongside R1, and while they were better in some select (mostly math) and coding domains, there were problems resulting from fine tuning that didn't result in them overtaking the original models in usage.

cess11 · 1h ago

Seems that's mostly a byproduct from working on the core business idea, GPU arbitrage.

esafak · 8h ago

How are they ensuring robustness against adversarial responses?

nsingh2 · 8h ago

From the article, seems like TOPLOC:

> based on top of novel components such as TOPLOC, which verifies rollouts from untrusted inference workers

https://github.com/PrimeIntellect-ai/toploc

xmasotto · 5h ago

Can an expert explain how this protects against adversarial actors?

At a glance it looks like something akin to a computing a checksum that's locality sensitive, so it's robust to floating point errors, etc.

What's to stop someone from sending bad data + a matching bad checksum?

yorwba · 3h ago

The validation procedure is described on page 8 of the TOPLOC paper: https://arxiv.org/abs/2501.16007

The checksum is validated by redoing the computation, but making use of the fact that you already have the entire response to enable greater parallelism than when generating it one token at a time.

Mougatine · 23m ago

very cool work!

schneehertz · 7h ago

I used to have an idea related to science fiction novels that artificial intelligence could aggregate computing power through the network to perform ultra-large-scale calculations, thereby achieving strong artificial intelligence. Reality will also develop in this way, which is very interesting

mountainriver · 8h ago

Awesome work this team is doing. Globally distributed MoE could have real legs

ndgold · 8h ago

Pretty badass

quantumwoke · 8h ago

Wonder what the privacy story is like. Enterprises don't usually like broadcasting their private data across a freely accessible network.

bjt12345 · 7h ago

A strong use case here for quantum-safe encryption.

jumploops · 8h ago

Congrats to the team on the launch!

Personal story time: I met a couple of their engineers at an event a few months back. They mentioned they were building a distributed training system for LLMs.

I asked them how they were building it and they mentioned Python. I said something along the lines of “not to be the typical internet commenter guy, but why aren’t you using something like Rust for the distributed system parts?”

They mumbled something about Python as the base for all current LLMs, and then kinda just walked away…

From their article: > “Rust-based orchestrator and discovery service coordinate permissionless workers”

Glad to see that I wasn’t entirely off-base :)

Havoc · 4h ago

Given the latencies at play python did probably make more sense though

I'd rather read the prompt (claytonwramsey.com)

From: Steve Jobs. "Great idea, thank you." (blog.hayman.net)

The curse of knowing how, or; fixing everything (notashelf.dev)

Show HN: Clippy – 90s UI for local LLMs (felixrieseberg.github.io)

Plain Vanilla Web (plainvanillaweb.com)

Void: Open-source Cursor alternative (github.com)

Ty: A fast Python type checker and language server (github.com)

Design for 3D-Printing (blog.rahix.de)

Zed: High-performance AI Code Editor (zed.dev)

Gemini 2.5 Pro Preview (developers.googleblog.com)

The Death of Daydreaming (afterbabel.com)

OpenAI reaches agreement to buy Windsurf for $3B (bloomberg.com)

My new deadline: 20 years to give away virtually all my wealth (gatesnotes.com)

Claude's system prompt is over 24k tokens with tools (github.com)

ALICE detects the conversion of lead into gold at the LHC (home.cern)

CLion Is Now Free for Non-Commercial Use (blog.jetbrains.com)

Matt Godbolt sold me on Rust by showing me C++ (collabora.com)

Evolving OpenAI's Structure (openai.com)

First American pope elected and will be known as Pope Leo XIV (cnn.com)

A critical look at MCP (raz.sh)

LegoGPT: Generating Physically Stable and Buildable Lego (avalovelace1.github.io)

Waiting for Postgres 18: Accelerating Disk Reads with Asynchronous I/O (pganalyze.com)

Business books are entertainment, not strategic tools (theorthagonist.substack.com)

NSF faces shake-up as officials abolish its 37 divisions (science.org)

Unity’s Open-Source Double Standard: the ban of VLC (mfkl.github.io)

Vision Now Available in Llama.cpp (github.com)

Reservoir Sampling (samwho.dev)

Show HN: Real-time AI Voice Chat at ~500ms Latency (github.com)

Ask HN: What are good high-information density UIs (screenshots, apps, sites)?

Leaving Google (airs.com)

Mistral ships Le Chat – enterprise AI assistant that can run on prem (mistral.ai)

India launches attack on 9 sites in Pakistan and Pakistani Jammu and Kashmir (reuters.com)

High tariffs become 'real' with our first $36K bill (blog.adafruit.com)

One-Click RCE in Asus's Preinstalled Driver Software (mrbruh.com)

A simple 16x16 dot animation from simple math rules (tixy.land)

Curl: We still have not seen a valid security report done with AI help (linkedin.com)

VVVVVV Source Code (github.com)

So Much Blood (dynomight.net)

Replacing Kubernetes with systemd (2024) (blog.yaakov.online)

The vocal effects of Daft Punk (bjango.com)

Observations from people-watching (skincontact.substack.com)

Rust’s dependencies are starting to worry me (vincents.dev)

Sneakers (1992) – 4K makeover sourced from the original camera negative (blu-ray.com)

Launch HN: Exa (YC S21) – The web as a database

US vs. Google amicus curiae brief of Y Combinator in support of plaintiffs [pdf] (storage.courtlistener.com)

An appeal to Apple from Anukari (anukari.com)

Judge said Meta illegally used books to build its AI (wired.com)

As an experienced LLM user, I don't use generative LLMs often (minimaxir.com)

Sofie: open-source web based system for automating live TV news production (nrkno.github.io)

RybbitL Open source Google Analytics replacement (github.com)

Intellect-2 Release: The First 32B Model Trained Through Globally Distributed RL

Comments (41)