Uncertain<T>

252 samtheprogram 57 8/28/2025, 5:22:54 PM nshipster.com ↗

Comments (57)

btown · 15m ago

Once one understands that a variable (in a programming context) can hold a specification for a variable (in a mathematical context), one opens up incredible doors that are at the foundation of modern AI.

When you see y = m * x + b, your recollections of math class may note that you can easily solve for "m" or find a regression for "m" and "b" given various data points. But from a programming perspective, if these are all literal values, all this is is a "render" function. How can you reverse an arbitrary render function?

There are various approaches, depending on how Bayesian you want to be, but they boil down to: if your language supports redefining operators based on the types of the variables, and you have your variables contain a full specification of the subgraphs of computations that lead to them... you can create systems that can simultaneously do "forward passes" by rendering the relationships, and "backward passes" where the system can automatically calculate a gradient/derivative and thus allow a training system to "nudge" the likeliest values of variables in the right direction. By sampling these outputs, in a mathematically sound way, you get the weights that form a model.

Every layer in a deep neural network is specified in this way. Because of the composability of these operations, systems like PyTorch can compile incredibly optimal instructions for any combination of layers you can think of, just by specifying the forward-pass relationships.

So Uncertain<T> is just the tip of the iceberg. I'd recommend that everyone experiment with the idea that a numeric variable might be defined by metadata about its potential values at any given time, and that you can manipulate that metadata as easily as adding `a + b` in your favorite programming language.

AlotOfReading · 6h ago

A small note, but GPS is only well-approximated by a circular uncertainty in specific conditions, usually open sky and long-time fixes. The full uncertainty model is much more complicated, hence the profusion of ways to measure error. This becomes important in many of the same situations that would lead you to stop treating the fix as a point location in the first place. To give a concrete example, autonomous vehicles will encounter situations where localization uncertainty is dominated by non-circular multipath effects.

If you go down this road far enough you eventually end up reinventing particle filters and similar.

mikepurvis · 5h ago

Vehicle GPS is usually augmented by a lot of additional sensors and assumptions, notably the speedometer, compass, and knowledge the you'll be on one of the roads marked on its map. Not to mention a fast fix because you can assume you haven't changed position since you last powered on.

monocasa · 4h ago

As well as a fast fix because you know what mobile cell or wifi network you're on.

boscillator · 7h ago

Does this handle covariance between different variables? For example, the location of the object your measuring your distance to presumably also has some error in it's position, which may be correlated with your position (if, for example, if it comes from another GPS operating at a similar time).

Certainly a univarient model in the type system could be useful, but it would be extra powerful (and more correct) if it could handle covariance.

evanb · 5h ago

If you need to track covariance you might want to play with gvar https://gvar.readthedocs.io/en/latest/ in python.

layer8 · 6h ago

To properly model quantum mechanics, you’d have to associate a complex-valued wave function with any set of entangled variables you might have.

8note · 5h ago

for mechanical engineering drawings to communicate with machinists and the like, we use tolerances

eg. 10cm +8mm/-3mm

for what the acceptable range is, both bigger and smaller.

id expect something like "are we there yet" referencing GPS should understand the direction of the error and what directions of uncertainty are better or worse

mabster · 3h ago

Something that's bugged me about this notation though is that sometimes it means "cannot exceed the bounds" and sometimes it means "only exceeds the bounds 10% of the time"

taneq · 2h ago

I don’t think I’ve ever seen mechanical drawings have “90% confidence” dimensions like this. If a part’s too big then it won’t fit, and it’s probably useless.

kevin_thibedeau · 2h ago

If a test procedure is verifying all dimensional accuracy, it can be assumed to be bounding tolerance. If it's a mass production line with less than 100% testing of parts, you'd have to expect that some outliers get by and the tolerance is something like 3-sigma on a Gaussian.

j2kun · 5h ago

This concept has been done many times in the past, under the name "interval arithmetic." Boost has it [1] as does flint [2]

What is really curious is why, after being reinvented so many times, it is not more mainstream. I would love to talk to people who have tried using it in production and then decided it was a bad idea (if they exist).

[1]: https://www.boost.org/doc/libs/1_89_0/libs/numeric/interval/... [2]: https://arblib.org/

kccqzy · 4h ago

The article says,

> Under the hood, Uncertain<T> models GPS uncertainty using a Rayleigh distribution.

And the Rayleigh distribution is clearly not just an interval with a uniformly random distribution in between. Normal interval arithmetic isn't useful because that uniform random distribution isn't at all a good model for the real world.

Take for example that Boost library you linked. Ask it to compute (-2,2)*(-2,2). It will give (-4,4). A more sensible result might be something like (-2.35, 2.35). The -4 lower bound is only attainable when you have -2 and 2 as the multiplicands which are at the extremes of the interval; probabilistically if we assume these are independent random variables then two of them achieving this extreme value simultaneously should have an even lower probability.

woah · 1h ago

Using simple types (booleans etc) is very simple and easy to reason about, and any shortcomings are obvious. Trying to model physical uncertainty is difficult and requires different models for different domains. Once you have committed to needing to do that, it would be much better to use a purpose built model instead of a library which put some bell curves behind a pretty API.

eru · 6m ago

I agree that different application strictly speaking need different models of uncertainty.

But I'm not so sure in your conclusion: a good enough model could be universally useful. See how everyone uses IEEE 754 floats, despite them giving effectively one very specific model of uncertainty. Most of the time this just works, and sometimes people have to explicitly work around floats' weirdnesses (whether that's because they carefully planned ahead because they know what they are doing, or whether they got a nasty surprise first). But overall they are still useful enough to be used almost universally.

Tarean · 4h ago

Interval arithmetic is only a constant factor slower but may simplify at every step. For every operation over numbers there is a unique most precise equivalent op over intervals, because there's a Galois connection. But just because there is a most precise way to represent a set of numbers as an interval doesn't mean the representation is precise.

A computation graph which gets sampled like here is much slower but can be accurate. You don't need an abstract domain which loses precision at every step.

bee_rider · 4h ago

It would have been sort of interesting if we’d gone down the road of often using interval arithmetic. Constant factor slower, but also the operations are independent. So if it was the conventional way of handling non-integer numbers, I guess we’d have hardware acceleration by now to do it in parallel “for free.”

eru · 4m ago

You can probably get the parallelism for interval arithmetic today? Though it would probably require a bit of effort and not be completely free.

On the CPU you probably get implicit parallel execution with pipelines and re-ordering etc, and on the GPU you can set up something similar.

orlp · 1h ago

Not sure why this is being upvoted as the article is not describing interval arithmetic. It supports all kinds of uncertainty distributions.

PaulDavisThe1st · 2h ago

Several years ago when I discovered some of the historical work on interval arithmetic, I was astounded to find that there was a notable contingent in the 60s that was urging hardware developers to make interval arithmetic be the basic design of new CPUs, and saying quite forcefully that if we simply went with "normal" integers and floating point, we'd be unable to correctly model the world.

skissane · 48m ago

I think as another commenter pointed out, interval arithmetic’s problem is that while it acknowledges the reality of uncertainty, its model of uncertainty is so simplistic, in many applications it is unusable. So making it the standard primitive, could potentially result in the situation where apps that don’t need to explicitly model uncertainty at all have to pay the price of being forced to do so; meanwhile, apps which need a more realistic model of uncertainty are being forced to do so while being hamstrung by its interactions with another overly simple model. It is one of those ideas which sounds great in theory, but there are good reasons it never succeeded in practice-the space of use cases where explicitly modelling uncertainty is desirable, but where the simplistic model of interval arithmetic is entirely adequate, is rather small-a standard primitive which only addresses the needs of a narrow subset of use cases is not a good architecture

pklausler · 4h ago

Interval arithmetic makes good intuitive sense when the endpoints of the intervals can be represented exactly. Figuring out how to do that, however, is not obvious.

eru · 3m ago

Also not all uncertainties are modeled well by uniform distributions over an interval.

anal_reactor · 3h ago

Because reasoning about uncertain values / random variables / intervals / fuzzy logic / whatever is difficult and the model where things are certain is much easier to process while it models the reality well enough.

black_knight · 5h ago

This seems closely related to this classic Functional Pearl: https://web.engr.oregonstate.edu/~erwig/papers/PFP_JFP06.pdf

It’s so cool!

I always start my introductory course on Haskell with a demo of the Monty Hall problem with the probability monad and using rationals to get the exact probability of winning using the two strategies as a fraction.

contravariant · 3h ago

I feel like if you're worried about picking the right abstraction then this is almost certainly the wrong one.

layer8 · 6h ago

Arguably Uncertain should be the default, and you should have to annotate a type as certain T when you are really certain. ;)

nine_k · 5h ago

Only for physical measurements. For things like money, you should be pretty certain, often down to exact fractional cents.

It appears that a similar approach is implemented in some modern Fortran libraries.

rictic · 4h ago

A person might have mistyped a price, a barcode may have been misread, the unit prices might be correct but the quantity could be mistaken. Modeling uncertainty well isn't just about measurement error from sensors.

I wonder what it'd look like to propagate this kind of uncertainty around. You might want to check the user's input against a representative distribution to see if it's unusual and, depending on the cost of an error vs the friction of asking, double-check the input.

bee_rider · 1h ago

Typos seem like a different type of error from physical tolerances, and one that would be really hard to reason about mathematically.

XorNot · 4h ago

Money has the problem that no matter how clever you are someone will punch all the values into Excel and then complain they don't match.

Or specify they're paying X per day, but want hourly itemized billing...but it should definitely come out to X per day (this was one employer which meant I invoiced them with like 8 digits of precision due to how it divided, and they refused to accept a line item for mathematical uncertainty aggregates).

random3 · 4h ago

have you ever tried working computationally with money? Forget money, have you worked with floating points? There really isn't anything certain.

nine_k · 4h ago

Yes, I worked in a billing department. No, floats are emphatically not suitable for representing money, except the very rounded values in presentations.

Floats try to keep the relative error at bay, so their absolute precision varies greatly. You need to sum them starting with the smallest magnitude, and do many other subtle tricks, to limit rounding errors.

esafak · 6h ago

A complement to Optional.

lxe · 3h ago

I really like that this leans on computing probabilities instead of forcing everything into closed-form math or classical probability exercises. I’ve always found it way more intuitive to simulate, sample, and work directly with distributions. With a computer, it feels much more natural to uh... compute: you just run the process, look at the results, and reason from there.

cb321 · 5h ago

If you are in an even more "approximate" mindset (as opposed to propagating by simulation to get real world re-sampled skewed distributions, as often happens in experimental physics labs, or at least their undergraduate courses), there is an error propagation (https://en.wikipedia.org/wiki/Propagation_of_uncertainty) simplification for "small" errors thing you can do. Then translating "root" errors to "downstream errors" is just simple chain rule calculus stuff. (There is a Nim library for that at https://github.com/SciNim/Measuremancer that I use at least every week or two - whenever I'm timing anything.)

It usually takes some "finesse" to get your data / measurements into territory where the errors are even small in the first place. So, I think it is probably better to do things like this Uncertain<T> for the kinds of long/fat/heavy tailed and oddly shaped distributions that occur in real world data { IF the expense doesn't get in your way some other way, that is, as per Senior Engineer in the article }.

munchler · 6h ago

Is this essentially a programmatic version of fuzzy logic?

https://en.wikipedia.org/wiki/Fuzzy_logic

esafak · 6h ago

https://en.wikipedia.org/wiki/Probabilistic_programming more like. It is already a thing; see, for example, https://pyro.ai/

dcsommer · 2h ago

Seems more proper to call it a `ProbabilityDistribution` type. It's a more general and intuitive way to handle the concept.

bee_rider · 57m ago

But the pun, uncertainty.

ngruhn · 2h ago

Yeah but the shorter name wins

nicois · 4h ago

Is there a risk that this will underemphasise some values when the source of error is not independent? For example, the ROI on financial instruments may be inversely correlated to the risk of losing your job. If you associate errors with each, then combine them in a way which loses this relationship, there will be problems.

No comments yet

mackross · 7h ago

Always enjoy mattt’s work. Looks like a great library.

krukah · 6h ago

Monads are really undefeated. This particular application feels to me akin to wavefunction evolution? Density matrices as probability monads over Hilbert space, with unitary evolution as bind, measurement/collapse as pure/return. I guess everything just seems to rhyme under a category theory lens.

valcron1000 · 5h ago

Relevant (2006): https://web.engr.oregonstate.edu/~erwig/pfp/

keeganpoppen · 2h ago

oh man i had forgotten about this blog from when i orbited the swift ecosystem a bit... it's clearly as great as always! fun post!

droideqa · 5h ago

Could this be implemented in Rust or Clojure?

Does Anglican kind of do this?

lloydatkinson · 4h ago

IS there the complete C# available for this? I looked over the original paper and it's just snippets.

kittoes · 2h ago

https://github.com/klipto/uncertainty

Pxtl · 1h ago

10 years since commit and no attached documents besides a tiny readme. Pass.

tricky_theclown · 4h ago

jakubmazanec · 7h ago

[flagged]

frizlab · 7h ago

> And why does it need to be part of the type system?

As presented in the article, it is indeed just a library.

muxl · 6h ago

It was chosen to be implemented as a generic type in this design because the way that uncertainty "pollutes" underlying values maps well onto monads which were expressed through generics in this case.

cobbal · 7h ago

I don't think inference is part of this at all, frequentist or otherwise.

It's not part of the type system, it's just the giry monad as a library.

geocar · 6h ago

> What if I want Bayesian?

Bayes is mentioned on page 46.

> And why does it need to be part of the type system? It could be just a library.

It is a library that defines a type.

It is not a new type system, or an extension to any particularly complicated type system.

> Am I missing something?

Did you read it?

https://www.microsoft.com/en-us/research/wp-content/uploads/...

https://github.com/klipto/Uncertainty/

jakubmazanec · 6h ago

> Bayes is mentioned on page 46.

Bayes isn't mentioned in the linked article. But thanks for the links.

Ask HN: The government of my country blocked VPN access. What should I use?

Ask HN: What to Learn for Math for Modeling?

Ask HN: What to do when you suspect your interview is with a state operative?

Ask HN: How much better can the LLMs become assuming no AGI

Ask HN: Why hasn't x86 caught up with Apple M series?

Ask HN: How to teach a 4 year old to code?

Ask HN: Services for Shutting Down a Startup?

Ask HN: Where can I see a live octopus in Maine?

Ask HN: Anyone working on bringing software back from US clouds?

CompactifAI Inference API

Ask HN: How can I recover and run my old mobile game from the 2010s?

Petition to stop Google from restricting sideloading and FOSS apps

Ask HN: Did modern AI's coding abilities make you lose interest in programming?

Anthropick.com Redirects to ChatGPT

Ask HN: What are the best Google alternatives in 2025?

Ask HN: GitHub Copilot down?

Ask HN: What to Do with Old iPads?

Ask HN: Windows 11 Update Fail – Linux Distro Suggestions?

Units of Economics of LLMs. Reply to Ed Zitron's "AI Is a Money Trap"

Ask HN: Best codebases to study to learn software design?

Ask HN: Does sentience put stress on the brain?

Ask HN: How to Learn to Build Agentic AI Systems (Like Claude Code)

Ask HN: Is there a temp phone number like temp email?

Ask HN: What measures are you taking to stop AI crawlers?

Out of curiosity: what kind of people use this "forum" (I mean Hacker News)?

Ask HN: Is backlink trading still a problem worth solving?

Ask HN: Why are so many services rejecting Google Voice numbers for signups?

Ask HN: What should I use to run React Native tests on a device?

Stop squashing your commits. You're squashing your AI too

Ask HN: Are AI filters becoming stricter than society itself?

Tell HN: any reasonably used DB will likely outlast the programs using it

Ask HN: Any experienced devs who use AI extensively in their work?

Ask HN: I just abandoned my PyCharm subscription, what should I use now?

Ask HN: How do you find early stage startups to join

Ask HN: How are you attributing your AI usage when developing software?

Patient Lisp Hacker Seeks Same for Long Walks Through IPL-V Code

Ask HN: Has anyone else used online communities that are archetypically "savvy"?

Uncertain<T>

Comments (57)