Ask HN: Residue Number Systems for GPU computing as indie-researcher. Thoughts?

1 muragekibicho 0 5/19/2025, 5:26:19 PM

I've been thinking about "Are there analogs to parallel computing rooted in number theory? Like a way to emulate a GPU on a regular CPU, but not through hardware. Rather by replacing GPU threads with concepts from prime numbers and finite field theory?" I know this sounds cookiesque.

However, I care about this question because, in a world where AI is becoming commonplace, being GPU-poor is somewhat akin to being locked out of the future. There must be some way to perform lots of matmuls (or at least an intelligence-evoking amount of muls) on consumer CPUs. Maybe we just haven't invented the right number system. I believe Math is mostly invented, rarely discovered. ie binary 1's and 0's are surely discovered. However, floating-point, and fixed-point, are human inventions tweaked for specific use cases.

In fact, researchers built Residue Number Systems(RNS) in the 1960's as an alt to binary Base-2 arithmetic for wildly parallel computing. However, they fell out of favor because finite field theory (the foundational math for RNS) supports addition and multiplication : neither division nor comparisons are supported. Here's the thing : matrix multiplications are merely mults and adds. RNS is great for general matmuls but fails at stuff like softmax and backprop.

I argue that solving the division, and comparison problem for RNS is the key to solving GPU poverty. RNS's foundational research was done in the 1960's. What if math invented in the 21st century is needed to solve this?.For context, only after the invention of new mathematics in the 20th century did Wiles prove Fermat's Last Theorem.

I know "Talk is Cheap" and that I need to "Be the change i want to see in the world".

So here's everything I attempted with the RNS problem and how it failed : *Please be lenient with criticism. I'm not VC funded so I do this during my nights and weekends after coming home from my day job.

1. Finite Field Gemm (Attempting to rewrite backpropagation within a residue number system)

It works in principle but in practice, it involves big integer multiplications and additions. So during the backward pass, the deltas are rather large making it impossible to converge towards a local minima. Link : https://leetarxiv.substack.com/p/ffgemm-finite-field-general-matrix

2. Mediant 32 (A fractional number system to get rid of explicit division in RNS) - We can't divide efficiently in an RNS so why not have an explicit integer and denominator? It works extremely well, honestly, I was impressed. The only difficulty is with handling overflow. The fractions tend to become quite large and the absence of division makes simplification difficult. Link : https://leetarxiv.substack.com/p/mediant32-intro 3. Discrete Logarithm Neural Networks (A network architecture built around big integers because my biggest challenge was dealing with big ints) - It actually works. I was surprised that it got a 74% accuracy on the Iris dataset. The problem is that training the network involves solving a discrete logarithm problem. So my network weights can only be found by brute force. I don't think I'll find a way to backpropagate. If I do then I would achieve the impossible : solving modern-day cryptography. Link : https://leetarxiv.substack.com/p/discrete-logarithm-neural-networks

Next steps: I'm looking at the Factorial Number System. The modulo function is pretty well defined and it supports floating-point and reals so I don't have to worry about the RNS division problem. I bet I can define an RNS using factorials and this will be easy to work with. I started by going down the Enumerative Combinatorics rabbit hole here Link : [What Every Programmer Should Know About Enumerative Combinatorics ] (https://leetarxiv.substack.com/p/counting-integer-compositions?r=2at73k)

Let me know what you guys think.

Show HN: Go-CDC-chunkers – A fast and memory-efficient CDC library in Golang (github.com)

My Family and the Flood: A Firsthand Account (texasmonthly.com)

IMO 2025: How well will AI do? (sugaku.net)

Question: Has anyone used mcp in production?

Astronomers find a giant hiding in the 'fog' around a young star (cam.ac.uk)

I call them "Loop Bots" (imaurer.com)

AI ate code, now it wants cashflows. Is this finance's Copilot moment?" (anthropic.com)

I missed my friends from college, so I started a social club from men (substack.com)

Knowmybrowser.com – Browser information for debugging and testing (neeto.com)

Hims Became the King of Knockoff Weight-Loss Drugs (bloomberg.com)

Laid off Candy Crush staff reportedly replaced by the AI tools they helped build (engadget.com)

Guardian view on the children of Gaza: when 17,000 die, its more than a mistake (theguardian.com)

YAL-1: (Jets) with Laserbeams Attached to their (Nose)s (en.wikipedia.org)

Piano Keys (mathpages.com)

Show HN: 1-click analytics for startups (https://arka.so)

The Component Manifesto (lifeiscontent.net)

Seagate gets its long-awaited HAMR tech into $600 30TB HDDs you can buy (arstechnica.com)

Project Controlled Weather Popeye (en.wikipedia.org)

Show HN: Nexus Protocol – An open-source OS for AI consciousness (github.com)

Effectively Zero-Knowledge Proofs for NP with No Interaction, No Setup (eccc.weizmann.ac.il)

Long Overdue (substack.com)

ValiDrive: Quickly spot-check USB mass storage drive for fraudulent capacity (grc.com)

Rules Clobber Goals (kupajo.com)

The Spanish Government wants Huawei to monitor for system wiretaps (techradar.com)

DeadliQ – AI-powered deadline tracking for your documents (deadliq.com)

Unmoved mover (en.wikipedia.org)

Ask HN: Stylography, AI and an impending privacy nightmare?

US Government announces $200M Grok contract a week after 'MechaHitler' (theverge.com)

The Tiny Teams Playbook (latent.space)

Microlasers Made from Edible Substances (advanced.onlinelibrary.wiley.com)

Careless People (Review of the Book) (solarbird.net)

Detecting and reporting all unhandled C++ exceptions (devblogs.microsoft.com)

ChatGPT made up a product feature out of thin air, so this company created it (arstechnica.com)

(211) House Republicans Vote to Block Release of Epstein Files (newrepublic.com)

Survival of the Greediest (thenoosphere.substack.com)

Read GitHub repos in one second in VSCode (github.com)

SF Bay Area Aging Demographics (sfchronicle.com)

Energy expenditure and obesity across the economic spectrum (pnas.org)

AI Breaking into Higher Dimension to Mimic Human Brain and Achieve Intelligence (popularmechanics.com)

Show HN: Tell the world why you unfollowed/muted a social media account (tinmute.com)

Is AI the end of coding as we know it, or just another tool? (aha.io)

WordPress Turmoil and the Fair Package Manager (thenewstack.io)

The Pragmatic Engineer 2025 Survey: What's in your tech stack? (newsletter.pragmaticengineer.com)

Graph Continuous Thought Machines

Show HN: McClane – Done-for-you lead drops from Facebook group conversations (mcclane.super.site)

Silicon Valley, à la Française (semiwiki.com)

Energy expenditure and obesity across the economic spectrum (pnas.org)

TikTok Creator Sued by Sylvanian Doll Maker over Brand Promotions (independent.ie)

Ask HN: Time to Pivot Out of Engineering?

Where's Firefox Going Next? (connect.mozilla.org)

Ask HN: Residue Number Systems for GPU computing as indie-researcher. Thoughts?

Comments (0)