Show HN: I built a system to make ChatGPT brutally honest with you (honestprompts.com)

The emphasis is indeed on "without trust" – as far as I can tell this project is unable to verify whether the decentralized training nodes are contributing productively.

Without the ability to validate that training compute is heading in the globally desired direction, it is unlikely you could use it as the foundation of a (sound) cryptocurrency.

proof_by_vibes · 1h ago

There could be merit to this. Proofs are generally computationally hard, so it's possible that a currency could be created by quantifying verification.

littlestymaar · 1h ago

> To stop wasting computing resources in crypto currencies and get something useful as a byproduct.

Bitcoin is the only major cryptocurrency that still use proof of work today (others are either using “proof of stakes” or are “Layer 2” chains), and due to its (relative lack of) governance structure, it's very unlikely to ever change.

3abiton · 2h ago

This is rather exciting! I see the future of Co-op models made by a community of experts on a specific field that would still allow them to be competitive with "AI monopolies". Maybe not all hope is lost!

schneehertz · 3h ago

I used to have an idea related to science fiction novels that artificial intelligence could aggregate computing power through the network to perform ultra-large-scale calculations, thereby achieving strong artificial intelligence. Reality will also develop in this way, which is very interesting

danielhanchen · 2h ago

I made some GGUFs at https://huggingface.co/unsloth/INTELLECT-2-GGUF

./llama.cpp/llama-cli -hf unsloth/INTELLECT-2-GGUF:Q4_K_XL -ngl 99

Also it's best to read https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-e... on sampling issues for QwQ based models.

Or TLDR, use the below settings:

./llama.cpp/llama-cli -hf unsloth/INTELLECT-2-GGUF:Q4_K_XL -ngl 99 --temp 0.6 --repeat-penalty 1.1 --dry-multiplier 0.5 --min-p 0.00 --top-k 40 --top-p 0.95 --samplers "top_k;top_p;min_p;temperature;dry;typ_p;xtc"

jumploops · 4h ago

Congrats to the team on the launch!

Personal story time: I met a couple of their engineers at an event a few months back. They mentioned they were building a distributed training system for LLMs.

I asked them how they were building it and they mentioned Python. I said something along the lines of “not to be the typical internet commenter guy, but why aren’t you using something like Rust for the distributed system parts?”

They mumbled something about Python as the base for all current LLMs, and then kinda just walked away…

From their article: > “Rust-based orchestrator and discovery service coordinate permissionless workers”

Glad to see that I wasn’t entirely off-base :)

No comments yet

refulgentis · 4h ago

I guess I'm bearish?

It's not that they trained a new model, but they took an existing model and RL'd it a bit?

The scores are very close to QwQ-32B, and at the end:

"Overall, as QwQ-32B was already extensively trained with RL, it was difficult to obtain huge amounts of generalized improvement on benchmarks beyond our improvements on the training dataset. To see stronger improvements, it is likely that better base models such as the now available Qwen3, or higher quality datasets and RL environments are needed."

fabmilo · 3h ago

The interesting delta here is that this proves that we can distribute the training and get a functioning model. The scaling factor is way bigger than datacenters

comex · 2h ago

But does that mean much when the training that produced the original model was not distributed?

refulgentis · 2h ago

The RL, not the training. No?

christianqchung · 3h ago

Third party fine tuned open weighted LLMs tend to be good at a handful of benchmarks, but parity or lower on others compared to the original model. There are some exceptions like Nvidia's Nemotron series, but the differences generally are so small as to be imperceptible. Deepseek released finetunes of several Qwen and Llama models alongside R1, and while they were better in some select (mostly math) and coding domains, there were problems resulting from fine tuning that didn't result in them overtaking the original models in usage.

mountainriver · 4h ago

Awesome work this team is doing. Globally distributed MoE could have real legs

esafak · 4h ago

How are they ensuring robustness against adversarial responses?

nsingh2 · 4h ago

From the article, seems like TOPLOC:

> based on top of novel components such as TOPLOC, which verifies rollouts from untrusted inference workers

https://github.com/PrimeIntellect-ai/toploc

xmasotto · 36m ago

Can an expert explain how this protects against adversarial actors?

At a glance it looks like something akin to a computing a checksum that's locality sensitive, so it's robust to floating point errors, etc.

What's to stop someone from sending bad data + a matching bad checksum?

ndgold · 4h ago

Pretty badass

quantumwoke · 4h ago

Wonder what the privacy story is like. Enterprises don't usually like broadcasting their private data across a freely accessible network.

bjt12345 · 3h ago

A strong use case here for quantum-safe encryption.

Show HN: I built a system to make ChatGPT brutally honest with you (honestprompts.com)

Fundings for NFT (indiegogo.com)

How to pretend to work 40 hours a week (miserablyemployed.com)

Colmap – Structure-from-Motion and Multi-View Stereo (colmap.github.io)

SurrealQL (surrealdb.com)

Turkey: PKK Announces Intention to Disband (dw.com)

Discovery by Norfolk metal detectorist baffles experts (edp24.co.uk)

DeerFlow, a community-driven Deep Research framework (github.com)

US AI execs urge improved infrastructure and chip exports to beat China (reuters.com)

Toolkami (github.com)

Microsoft Unveils SurfaceLaptop13 and SurfacePro12 with AI Snapdragon X Plus 4nm (reuters.com)

Values of β will give rise to DOM (9p.io)

The perfect game engine for indie game developers

Writing at the Speed of Thought [video] (youtube.com)

SpaceRadiationTolerant Framework Achieves 97% Uptime in Extreme Space Conditions (github.com)

EuroBillTracker – Follow your Euro notes (en.eurobilltracker.com)

Why Gen X is the real loser generation (economist.com)

Nvidia's RTX Pro 5000 Specs – Here's What Stands Out for Local LLM Work (hardware-corner.net)

Explaining the Failures of Obesity Therapy (nature.com)

You: "should I write a book?" (twitter.com)

Flakes Have Failed (kilo.bytesize.xyz)

Highlights from the Comments on AI GeoGuessr (astralcodexten.com)

Babies Are Born in Blood and Chaos (world.hey.com)

Microsoft silently fixes Windows 10 Start Menu jump list bug (bleepingcomputer.com)

Stop Saying "Responsible Disclosure" (da.vidbuchanan.co.uk)

Winning Cluedo (bitsandtheorems.com)

Top New Artificial Intelligence Innovations in 2025 (wilnickmagazine.com)

Show HN: BizzRev – Personalized Business and Tech News Feed (apps.apple.com)

Why Genghis Khan's Tomb Has Never Been Discovered (utubepublisher.in)

Unreleased RTX Titan Ada prototype showcased, 48GB VRAM, dual 16-pins (tomshardware.com)

Show HN: Blog comments, nice looking, open source – Talkyard (blog-comments.talkyard.io)

Fosstodon Community Statement – Cleaning house, owning past mistakes (hub.fosstodon.org)

Norway hands over Arctic Council intact after 'difficult' term as chair (theguardian.com)

Replacing tmux and GNU screen with Emacs (masteringemacs.org)

Terence Tao: Formalizing a proof in Lean using GitHub Copilot and canonical (youtube.com)

Show HN: Clean, high-quality AI image generation (shutterly.co)

The Disaster Cycle [video] (youtube.com)

Little Language Lessons – Google Labs (labs.google)

Trump To Sign EO Aimed at Lowering Drug Prices (wsj.com)

Ask HN: Zer0 Browser – A Fast, Private Browser with Zero Bloat?

Netcetera used Clojure+Rama to 100x a product used by millions (blog.redplanetlabs.com)

India's Perfumers Recreate the Smell of Rain on Earth [video] (youtube.com)

I Don't Have Spotify (idonthavespotify.donado.co)

Trump to sign executive order to cut prices of medicine to match other countries (reuters.com)

Ask HN: Pipelines with WASM Components

Ocamlfind will not build on OS X Catalina if CLICOLOR=1 (github.com)

Show HN: Nashville Lyric and Chord Chart Formatter (git.sr.ht)

About Green Screens and mouse-clickable UIs (try-as400.pocnet.net)

Property Division Calculator – A California Divorce App (ca-divorce.streamlit.app)

Ask HN: Cursor or Windsurf?

Intellect-2 Release: The First 32B Model Trained Through Globally Distributed RL

Comments (23)