EU court rules nuclear energy is clean energy (weplanet.org)

If I am understanding this correctly, this is pretty damn cool. I got 15 minutes of research on it, but no better way to get corrected than be wrong on the internet.

Essentially it seems that they can statistical magic "fuzz" the training set in such a way that it becomes very difficult for the model to leak information from the training set, while still providing the same output whether or not that info was in the training set. So I suppose the goal would be something like the ability to train on medical data, while making it so the model won't be able to complete the prompt "Workaccount 2 has a serious medical condition called ______" and would give the same response regardless of whether or not I was present in the database.

HenryMulligan · 10m ago

Ignoring what this model architecture could do and just considering what this model does do, why would I (or anyone) want to run this model (locally) to do <insert use-case>? Is it entirely a proof-of-concept for future training on medical data? Are they looking to use this to attempt to ethically justify training on (free-tier) user's personal data via the application of noise to the training data?

floridianfisher · 8m ago

The purpose is research

diggan · 38m ago

The actual weights: https://huggingface.co/google/vaultgemma-1b

> VaultGemma is a variant of the Gemma family of lightweight, state-of-the-art open models from Google. It is pre-trained from the ground up using Differential Privacy (DP). This provides strong, mathematically-backed privacy guarantees for its training data, limiting the extent to which the model's outputs can reveal information about any single training example.

> VaultGemma was trained using Tensor Processing Unit (TPU) hardware TPUv6e. Training large language models with the significant computational overhead of differential privacy requires specialized hardware. TPUs are designed to handle the massive computations involved, offering the performance, memory, and scalability necessary to train models like VaultGemma efficiently and sustainably.

Seems like it requires TPUs to run, as DP has a huge performance impact, so we're unlikely to see this in homelabs and similar environments, as far as I understand.

Edit: On second read, the TPUs were only used for training, but no description if anything specific for the hardware is needed, so assuming it's fine with a regular GPU?

ForHackernews · 1h ago

Can someone explain what this actually means? I assume this still runs on Google's cloud so it's not 'private' in any meaningful sense.

stephantul · 47m ago

It does not run on Google’s cloud. You can download the model and host it yourself, locally or using a provider you trust.

porridgeraisin · 8m ago

Differentially private means that:

training_algorithm(training data with a row that has "ForHackernews blood test report...") hard to distinguish from training_algorithm(training data without that) upto a factor of epsilon. They have explained further in the article itself with concrete values for epsilon.

EU court rules nuclear energy is clean energy (weplanet.org)

UTF-8 is a brilliant design (iamvishnu.com)

Many hard LeetCode problems are easy constraint problems (buttondown.com)

QGIS is a free, open-source, cross platform geographical information system (github.com)

The treasury is expanding the Patriot Act to attack Bitcoin self custody (tftc.io)

Rust: A quest for performant, reliable software [video] (youtube.com)

3D modeling with paper (arvinpoddar.com)

Humanely dealing with humungus crawlers (flak.tedunangst.com)

How FOSS Projects Handle Legal Takedown Requests (f-droid.org)

Vector database that can index 1B vectors in 48M (vectroid.com)

Advanced Scheme Techniques (2004) [pdf] (people.csail.mit.edu)

Qwen3-Next (qwen.ai)

Windows-Use: an AI agent that interacts with Windows at GUI layer (github.com)

How to Become a Pure Mathematician (Or Statistician) (hbpms.blogspot.com)

Oq: Terminal OpenAPI Spec Viewer (github.com)

Building a Deep Research Agent Using MCP-Agent (thealliance.ai)

Doom-ada: Doom Emacs Ada language module with syntax, LSP and Alire support (github.com)

VaultGemma: The most capable differentially private LLM (research.google)

Power series, power serious (1999!) [pdf] (cambridge.org)

Why do browsers throttle JavaScript timers? (nolanlawson.com)

Racintosh Plus – Rackmount Mac Plus (identity4.com)

Show HN: DWS OS, a Plan 9 Inspired Web “OS” (dws.rip)

Chat Control faces blocking minority in the EU (twitter.com)

A beginner's guide to extending Emacs (blog.tjll.net)

Show HN: An MCP Gateway to block the lethal trifecta (github.com)

Ships are sailing with fake insurance from the Norwegian Ro Marine (nrk.no)

Debian 13, Postgres, and the US time zones (rachelbythebay.com)

Show HN: I made a generative online drum machine with ClojureScript (dopeloop.ai)

Astrophysics Source Code Library (ascl.net)

K2-Think: A Parameter-Efficient Reasoning System (arxiv.org)

Steam Censorship of Adult Games Shows How Payment Processors Wield Immense Power (ign.com)

Crates.io phishing attempt (fasterthanli.me)

Introduction to Nyquist and Lisp Programming (manual.audacityteam.org)

Pgschema – Declarative schema migration for Postgres (pgschema.com)

An embarrassing failure of the US patent system: Nintendo's latest patents (pcgamer.com)

Ankit Gupta Joins YC as General Partner (ycombinator.com)

Classic GTK1 GUI Library (gitlab.com)

Using Emacs Org-Mode With Databases: A getting-started guide (gitlab.com)

Claude’s memory architecture is the opposite of ChatGPT’s (shloked.com)

Lumina-DiMOO: An open-source discrete multimodal diffusion model (synbol.github.io)

Examples from The LaTeX Companion book (3rd edition) (ctan.org)

Doorbell prankster that tormented residents of apartments turns out to be a slug (theguardian.com)

Becoming the person who does the thing (fredrivett.com)

Cloudflare.com API / dashboard is currently down (cloudflarestatus.com)

Float Exposed (float.exposed)

Why our website looks like an operating system (posthog.com)

Building my childhood dream PC (fabiensanglard.net)

China bans one-pedal driving in default modes by 2027 (asiaict.com)

OpenAI Grove (openai.com)

Adam (YC W25) Is Hiring to Build the Future of CAD (ycombinator.com)

VaultGemma: The most capable differentially private LLM

Comments (7)