Ask HN: Is GPU nondeterminism bad for AI?

2 ramity 0 6/5/2025, 9:47:52 PM

Argument:

- GPUs use parallelism

- Floating point math is not associative

- Rounding error accumulates differently

- GPUs generate noisy computations

- Known noise vs accuracy tradeoff in data

- Noise requires overparameterization/larger network to generalize

- Overparameterization prevents the network from fully generalizing to the problem space

Therefore, GPU nondeterminism seems bad for AI. Where did I go wrong?

Questions:

- Has this been quantified? As I understand it, the answer would be situational and tied to other details like network depth, width, architecture, learning rate, etc. At the end of the day, entropy means some sort of noise/accuracy tradeoff, but are we talking magnitudes like 10%, 1%, 0.1%?

- Because of the noise/accuracy tradeoff, it seems to hold that one could use a smaller network trained deterministically and achieve the same performance as X bigger network trained non-deterministically. Is this true, even if we're talking only a single neuron of a difference?

- If something like the problem space of driving a car is too large to be fully represented into a dataset (consider the atoms of the universe as a hard drive), how can we be sure a dataset is a perfect sampling of the problem space?

- Wouldn't overparameterization guarantee the model learns the dataset and not the problem space? Is it incorrect to conceptualize this as using a polynomial of a higher degree to represent another?

- Even with perfect sampling, noisy computation seems incompatible when a small amount of noise is capable of causing an avalanche. If this noise is somehow quantified to 1%, couldn't you say the dataset's "impression" left in the network would be 1% larger than it should, maybe spilling over in a sense? Eval data points "very close to" but not included in training datapoints would be more likely to incorrectly evaluate to as the same "nearby" training datapoint. Maybe I'm reinventing edge case and overfitting here, but I don't think overfitting just spontaneously starts happening towards the end of training.

Show HN: Colorcura – Visualize color palettes inside live UI components (colorcura.site)

Competitive Coder's Handbook [pdf] (cses.fi)

How to (actually) send DTMF on Android without being the default call app (edm115.dev)

I Let ChatGPT Make All My Architectural Decisions for a Month: The Surprising R (medium.com)

Alpaca's MCP Server for Trading/Quotes (github.com)

AI isn't coming for your job–it's coming for your company (fastcompany.com)

Three Types of Math Acceleration (kidswholovemath.substack.com)

BetterAuth vs. NextAuth (devtoolsacademy.com)

Swift and Cute 2D Game Framework: Setting Up a Project with CMake (layer22.com)

How to Use Vheer Text to Image Generator: A Beginner's Guide (readability.com)

Show HN: Book to help you build a PostgreSQL-like database server from scratch (technicaldeft.com)

Conventional commit generator using local LLMs (wimpysworld.com)

MCP Resources Are for Caching (timkellogg.me)

Balloons and Human Strength: How North Korea Righted a Toppled Warship (nytimes.com)

Fuzzer Blind Spots (Meet Jepsen) (tigerbeetle.com)

It's 2025 and Apple still has not fixed the audio left/right balance bug (twitter.com)

What you need to know about EMP weapons (aardvark.co.nz)

Now that we have stoves at home, restaurants are doomed (idiallo.com)

Aspects to video generation that may not be fully appreciated (Andrej Karpathy) (twitter.com)

Show HN: Internal URL Shortener (simpleurl.tech)

Weaponizing Dependabot: Pwn Request at its finest (boostsecurity.io)

Jepsen: TigerBeetle 0.16.11 (jepsen.io)

Streaks on Martian slopes are dry (nature.com)

AWS European Sovereign Cloud (aboutamazon.eu)

The AI Prompts Doge Used to "Munch" Contracts Related to Veterans' Health (propublica.org)

Bonfire Social 1.0 Release Candidate (bonfirenetworks.org)

Show HN: Personalized AI in Email (talaria.email)

Optimizing 1979's Manbiki Shounen: Shoplifting Boy for Commodore PET [video] (youtube.com)

From vibe code to production deployment

Show HN: Chat4Data, ChatGPT style web scraping extension (chat4data.ai)

The X.Org Server just got forked (announcing XLibre) (github.com)

German Commercial Register API (docs.registercheck.de)

AMD Radeon 8050S "Strix Halo" Linux Graphics Performance Review (phoronix.com)

Neural network-assisted handwriting analysis for Parkinson's diagnostics (nature.com)

Cognito M2M Pricing Changes (danieloldberg.se)

Show HN: Partycles – Zero-dependency React animations library with 11 effects (github.com)

Just 15 buyers are in charge of £14B in UK central government tech spending (theregister.com)

An Earnest Guide to Symbols in Common Lisp (kevingal.com)

Experiment of WebOS with Flutter for Better Performance and Playful Experience (webostv.developer.lge.com)

The Coleco Adam Computer (dfarq.homeip.net)

While America's Front Doors Are Closing to China, Back Doors Are Opening (robertreich.substack.com)

Free Penetration Test Report Template (pentestpad.com)

The DuckLake Manifesto: SQL as a Lakehouse Format (ducklake.select)

Show HN: An open-source rhythm dungeon crawler in 16 x 9 pixels (github.com)

Pgrwl v1.0.16 Adds Base Backup Support for PostgreSQL WAL Archiving (github.com)

How Chrome achieved the highest score ever on Speedometer 3 (blog.chromium.org)

Infomaniak backs mass surveillance, aims to end online anonymity in Switzerland (discuss.privacyguides.net)

Nintendo 64 Architecture (copetti.org)

Inside the collapse of Microsoft-backed UK tech unicorn Builder.ai (ft.com)

Sneak preview: Bricks integration for Gato GraphQL [video] (youtube.com)

Ask HN: Is GPU nondeterminism bad for AI?

Comments (0)