Show HN: WhiteLightning – ultra-lightweight ONNX text classifiers trained w LLMs (whitelightning.ai)

   Even if you explicitly deny a guarantee of a certain behavior in your contract,
   if you usually deliver that behavior,
   most of your customers will depend on it.

Some examples:

If you make a queueing system, it's impossible to guarantee anything other than delivery "at most once" (some loss occurs), or "at least once" (some duplication occurs), but if you usually provide "exactly once" in practice, most of your customers will depend on this.

If you provide a data bucket service, and guarantee availability, but not performance, and you usually provide 100MB/s throughput, your customers will have major problems if you only provide 10MB/s throughput in some cases.

If you make a self-driving car, and it requires human monitoring, but it's really good, say one intervention per year of driving . . . your customers will die because they aren't paying attention.

rossant · 1h ago

If I'm not mistaken, CPython's dict preserved insertion order as an implementation detail at first, but because too many users came to rely on it, it was made part of the language specification starting in Python 3.7.

weinzierl · 4h ago

“Die normative Kraft des Faktischen” or “the normative force of the factual” is a thing and usually not seen as necessarily bad.

It recognizes that legitimacy often emerges organically from social acceptance rather than top-down imposition. In technology we often see that evolving reference implementations work better than elaborate specifications.

leoc · 1h ago

In its form as ‘the normalisation of deviance’ it’s generally recognised as bad.

Animats · 2h ago

> If you make a queueing system, it's impossible to guarantee anything other than delivery "at most once" (some loss occurs), or "at least once" (some duplication occurs), but if you usually provide "exactly once" in practice, most of your customers will depend on this.

That's only a condition at termination. For ongoing communication, you can guarantee exactly once delivery. When communication ceases, the final state between the ends is indeterminate. If you can keep talking, or resume after breaks, it's a solvable problem.

dongkyun · 5h ago

This is explicitly recognized in contract law: course of performance / dealing is a factor courts will consider in evaluating the nature of a deal. (Most contracts will try and carve it out).

chuckadams · 4h ago

Can't disagree with anything you said, though I think there are steps to address at least some of them: for queueing systems, testing with a chaos monkey isn't a bad idea... you'd want a test environment representative of production workloads, which is hard to do, but anything should be better than nothing.

In the self-driving car scenario, you'd probably go with cold statistics: is it killing fewer people than ones that need more interventions? Just like queueing though, experiments in production could be problematic.

worik · 2h ago

> In the self-driving car scenario, you'd probably go with cold statistics

No. There is a big difference in an accident caused by human error and an accident caused by machine failure.

We tolerate much more of the former than the latter.

This feels like a cognitive failure, but I do not think it is

doormatt · 1h ago

We ran into this at SNS (AWS) all the time.

kibwen · 16m ago

> This effect serves to constrain changes to the implementation, which must now conform to both the explicitly documented interface, as well as the implicit interface captured by usage.

Let's be clear that this is one interpretation of the phenomenon described here, which we might call "The Doomerist Interpretation of Hyrum's Law". For everyone else, the whole reason that we bother categorize interface details into "public" and "private" buckets is precisely so we have the moral high ground to to tell people to go kick rocks when they get they get uppity about their own failure to adhere to the publicly documented interface.

atakan_gurkan · 13m ago

This reminds me of the "illegal opcodes" in Commodore 64. I believe they were even present in their entirety in Commodore 128's (which uses 8502, not 6502 as its main CPU) C64 emulation mode. If someone knows how or why they remained part of 6502's instruction set over C64's lifetime, I really would like to know. I suspect, it was deliberate since "customers relied on them".

AdamH12113 · 7h ago

From an API designer's standpoint (especially if that API has paying customers), Hyrum's Law is something that has to be taken into account. But from a user's standpoint, it is engineering malpractice, plain and simple. At the very least, relying on quirks of someone else's implementation is a risk that should be understood and accounted for, and no one has any reasonable grounds for complaint if those quirks suddenly change in a new version.

gwd · 6h ago

> At the very least, relying on quirks of someone else's implementation is a risk that should be understood and accounted for, and no one has any reasonable grounds for complaint if those quirks suddenly change in a new version.

It's almost always unintentional. Someone wrote some code, it works, they ship it, not realizing it only works if the list comes back in a specific order, or with a specific timing. Then a year or two later they do some updates, the list comes back in a different order, or something is faster or slower, and suddenly what worked before doesn't work.

This is why in Golang, for instance, when you iterate over map keys, it purposely does it in a random order -- to make sure that your program doesn't accidentally begin to rely on the internal implementation of the hash function.

ETA: But of course, that's not truly random, just pseudorandom. It's not impossible that someone's code only works because of the particular pseudorandom order they're generating, and that if Golang even changes the pseudorandom number generator they're using to evade Hyrum's Law that someone's code will break.

RyanCavanaugh · 5h ago

There's probably at least one game out there somewhere that uses Go's map iteration order to shuffle a deck of cards, and would thus be broken by Go removing the thing that's supposed to prevent you from depending on implementation details.

deathanatos · 5h ago

Intent enters into it when someone complains about something that is obviously out of the specification breaking.

Prior that, yeah, that's just a bug.

> This is why in Golang, for instance, when you iterate over map keys, it purposely does it in a random order

It could be that Go's intentions are different here, but IIRC languages will mix randomization into hashtables as it is otherwise a security issue. (You typically know the hash function, usually, so without randomization you can force hash collisions & turn O(1) lookups into O(n).)

CobrastanJorji · 3h ago

A counterexample would be Python, where dictionaries maintain their insertion order.

deathanatos · 1h ago

Python does the same hash randomization, but yes, it also maintains the insertion order. This is more expensive, obviously, as additional data has to be tracked.

porridgeraisin · 1h ago

> mix randomisation into hash tables.

I believe you don't understand.

In go, they literally randomly permute the iteration order of the map each time you iterate over it.

e.g

  for x in map {
  
  }

  for x in map {
   // different order
  }

Now, the fact that they randomize means people use it as a cheap "select random item from map" function :D, which is hyrums law all over again.

  var randomUser User
  for userId, user in usersMap {
    randomUser = user
    break
  }

Funny isn't it.

deathanatos · 1h ago

Well … it seems like you're right. (A playground — https://go.dev/play/p/OHQTIuDWicd — if anyone is curious.)

That's … pretty surprising, since that would seem to imply that iteration would require a O(n) sized chunk of memory somewhere to reify the order into. (And probably O(n) time to do the shuffle, or at least, a copy of the ordering; we should shuffle as we go, I suppose, but we'd need to track what we've emitted & what we've not, hence O(n) time & space to populate that scratch…) That seems undesirable.

RyanCavanaugh · 6h ago

The problem is that people commonly don't even realize they're depending on implementation quirks.

For example, they write code that unintentionally depends on some distantly-invoked async tasks resolving in a certain order, and then the library implementation changes performance characteristics and the other order happens instead, and it creates a new bug in the application.

don-code · 6h ago

I don't think such usage is malicious, so much as ignorant - it's sometimes hard to know that a behavior _isn't_ part of the API, especially if the API is poorly documented to begin with.

I maintain a number of such poorly-documented systems (you could, loosely, call them "APIs") for internal customers. We've had a number of scenarios where we've found a bug, flagged it as a breaking change (which it is), said "there's _no way_ anybody's depending on that behavior", only to have one or two teams reach out and say yes, they are in fact depending on that behavior.

For that reason, we end up shipping many of those types of changes ship with a "bug flag". The default is to use the correct behavior; the flag changes the behavior to remain buggy, to keep the internal teams happy. It's then up to us to drive the users to change their ways, which.. doesn't always happen efficiently, let's say.

patrickmay · 7h ago

Exactly. Hyrum's Law should always be paired with Postel's Law: Be conservative in what you do, be liberal in what you accept from others.

chuckadams · 4h ago

Being liberal in what you accept also leads to users depending on you accepting marginal input that exploits implementation quirks, either because the quirks get the job done or for more nefarious reasons.

bigstrat2003 · 2h ago

Yeah, I definitely don't agree with the "be liberal in what you accept" paradigm. It's worth it to force users to send correctly-formed data, IMO.

twodave · 6h ago

Hard disagree. If my users are exploiting some unintended, unannounced part of my API then me patching that out is something they’re just going to have to deal with. In well-described systems these sorts of behaviors lead to nasty bugs down the line, sometimes months in the future (e.g. “Huh, why aren’t my tax reports tying out?”).

partdavid · 5h ago

I think you're agreeing with GP, not disagreeing.

twodave · 3h ago

I was disagreeing with the notion that this law has to be taken into account. I suppose that’s true for certain software, but if e.g. Apple can get away with breaking these use cases then I don’t see why, as an API designer, I should care either.

bryanrasmussen · 1h ago

I think this really depends on who your customers are and how they pay for your services.

throw0101c · 6h ago

> From an API designer's standpoint (especially if that API has paying customers), Hyrum's Law is something that has to be taken into account.

How good-of-an-idea / best practice is API versioning?

    /api/v1/foo
    /api/v2/foo

What are the pluses and minuses?

ad_hockey · 4h ago

A couple of considerations are:

- You have to decide whether to bump the entire API version or only the /foo endpoint. The former can be a big deal (and you don't want to do it often), the latter is messy. Especially if you end up with some endpoints on /v1 (you got it right first time) while others are on /v4 or /v5. Some clients like to hard-code the URL prefix of your API, including the version, as a constant.

- You still have to decide what your deprecation and removal policy will be. Does there come a time when you remove /api/v1/foo completely, breaking even the clients who are using it correctly, or will you support it forever?

It's not easy at all, especially if you have to comply with a backwards compatibility policy. I've had many debates about whether it's OK to introduce breaking changes if we consider them to be bug fixes. It depends on factors like whether either behaviour is documented and subjective calls on how "obviously unintended" the behaviour might be.

cogman10 · 3h ago

Plus, easy to see that you might have to do something different to move over to v2 as client.

Minus, You will support v1 forever. It's almost impossible to make it go away.

detaro · 4h ago

you end up with a lot of versions if you version everything that could change some non-guaranteed behavior in some corner case.

breppp · 7h ago

Depends on the product. Sometimes you are completely dependent on an API ecosystem (iOS, Android, Windows) where the only way to achieve something is a quirk

kccqzy · 7h ago

Malpractice? It's usually just plain old bugs unintentionally written that way.

Ygg2 · 7h ago

> But from a user's standpoint

Not true generally. One man's engineering malpractice is another man's clever hack.

Users of Windows 95 complained that Windows 95 broke SimCity.

What did Windows 95 break? It fixed an obscure allocator bug SimCity was relying on.

Users loved Windows 95, for ""fixing"" this. How was it fixed? By introducing an obscure switch to old allocator if it detected SimCity in the app name.

https://arstechnica.com/gadgets/2022/10/windows-95-went-the-...

bigstrat2003 · 2h ago

Different users. The users that GP was accusing of malpractice would be the Maxis devs in this case, not the end users who were trying to install SimCity on their Windows 95 machine.

Microsoft has a commitment to backwards compatibility that I think is going too far, but I understand why. Raymond Chen has explained that if a user buys the new version of Windows and their programs stop working, they will blame MS regardless because they don't have any way to know it's the program's fault. So MS is incentivized to go out of their way to enable these other programs' bad behavior, because it keeps their (Microsoft's) customers happy.

azhenley · 7h ago

Discussion last year: https://news.ycombinator.com/item?id=39401973

dang · 4h ago

Thanks! Macroexpanded:

Hyrum’s Law in Golang - https://news.ycombinator.com/item?id=42201892 - Nov 2024 (183 comments)

Hash Ordering and Hyrum's Law - https://news.ycombinator.com/item?id=41673295 - Sept 2024 (41 comments)

Hyrum's Law - https://news.ycombinator.com/item?id=39401973 - Feb 2024 (66 comments)

Git archive generation meets Hyrum's law - https://news.ycombinator.com/item?id=34631275 - Feb 2023 (76 comments)

Hyrum's Law - https://news.ycombinator.com/item?id=33283849 - Oct 2022 (52 comments)

Hyrum's Law - https://news.ycombinator.com/item?id=29848295 - Jan 2022 (36 comments)

Hyrum's Law - https://news.ycombinator.com/item?id=27386818 - June 2021 (5 comments)

Hyrum's Law: An Observation on Software Engineering - https://news.ycombinator.com/item?id=21515225 - Nov 2019 (6 comments)

Hyrum's Law - https://news.ycombinator.com/item?id=19249199 - Feb 2019 (1 comment)

cafard · 1h ago

Actually, I think that in The Mythical Man-Month Brooks mentioned users depending on nominally undefined, practically consistent behavior, e.g. what was left in some part of a register.

prats226 · 4h ago

This is super interesting to think about in LLM world where lot of software is getting replaced with LLM calls.

In terms of output of an LLM, there is no clear promise in the contract, only observable behaviour. Also the observable behaviour is subject to change with every update in LLM. So all the downstream systems have to have evals to counter this.

One good example is claude code where now people have started complaining them switching models effecting their downstream coding workflows.

worik · 1h ago

Yes.

This is the unfortunate thing about wrapping LLMs in API calls to provide services.

Unless you control the model absolutely (even then?) you can prompt the model with a well manicured prompt on Tuesday and get an answer - a block of text - and on Thursday, using the exact same prompt, get a different answer.

This is very hard to build good APIs around. If done expect rare corner case errors that cannot be fixed.

Or reproduced.

nvader · 6h ago

It seems to me that there's some advantages to undertaking "Freedom of Navigation Operations" by randomizing implementations from time to time to discourage any dependence on internal behaviors.

For instance, traversal order of maps in Go is always randomized, to prevent subtle bugs caused by depending on the order.

As AI generated code becomes cheaper, it may be worthwhile to change some subset of your internal behaviors from release to release, so that users don't become too complacent.

amelius · 5h ago

This is why an API should always have an

    "assertions": true

option. Why should normal function calls have assertion/invariant checks, and not API calls?

porridgeraisin · 1h ago

This idea looks good. Have you used it in practice? Can you share how?

amelius · 30m ago

Yes, you basically use the option whenever you have assertions turned on in your code.

Then the service running the API will do extra checking when the assertions option is true, basically making it less forgiving and following the specification closely.

fsmv · 4h ago

A good example of defence against this is go maps randomize iteration order just so that people don't rely on it being consistent.

kiitos · 1h ago

> if an interface has enough consumers, they will collectively depend on every aspect of the implementation ...

Yep!

> [and that] constrains changes to the implementation, which must now conform to both the explicitly documented interface, as well as the implicit interface captured by usage

Nope!

Software authors define the rules for the software that they author. I understand it's a spectrum and the rules are different in different circumstances but at the end of the day my API is what I say it is and if you rely on something that I don't guarantee that's on you and not me. Hyrum's Law describes a common pathology, it doesn't define an expected rule or requirement.

TehCorwiz · 5h ago

XKCD always has a relevant comic: https://xkcd.com/1172/

xatax · 2h ago

That's linked in the OP.

Show HN: Draw a fish and watch it swim with the others (drawafish.com)

Show HN: TraceRoot – Open-source agentic debugging for distributed services (github.com)

Show HN: An interactive dashboard to explore NYC rentals data (leaseswap.nyc)

Show HN: Pontoon – Open-source customer data syncs (github.com)

Show HN: Rewindtty – Record and replay terminal sessions as structured JSON (github.com)

Show HN: I made a website that makes you cry (cryonceaweek.com)

Show HN: Mcp-use – Connect any LLM to any MCP (github.com)

Show HN: AgentMail – Email infra for AI agents (chat.agentmail.to)

Show HN: KubeForge – A GUI for Kubernetes YAMLs (github.com)

Show HN: LinCal – A Calendar View for Linear Issues (lincal.app)

Show HN: Tambo – a tool for building generative UI React apps with tools/MCP (github.com)

Show HN: Sourcebot – Self-hosted Perplexity for your codebase (github.com)

Show HN:typed - Markdown app for writers, students, professionals, and creators (play.google.com)

Show HN: List of Clojure-like projects (github.com)

Show HN: PLATO5 – AI-guided social engine to turn strangers into IRL friends (plato5.us)

Show HN: IsAgent – Detect agents like ChatGPT Agent on your website (isagent.dev)

Show HN: Open-Source Alternative to Modal (github.com)

Show HN: Kanban-style Phase Board: plan → execute → verify → commit (traycer.ai)

Show HN: WhiteLightning – ultra-lightweight ONNX text classifiers trained w LLMs (whitelightning.ai)

Show HN: Walk-through of rocket landing optimization paper [pdf] (scpowers.github.io)

Show HN: Find paint colours in Ireland and generate your own palettes (swatcher.ie)

Show HN: An AI agent that learns your product and guides your users (frigade.ai)

Show HN: Astro dev blog template with interactive colorschemes (multiterm.stelclementine.com)

Show HN: Open-source alternative to ChatGPT Agents for browsing (github.com)

Show HN: Compress Image – Simple Lossless and Lossy Image Compression Tool (compressimagex.com)

Show HN: Square Images – Make Any Image a Perfect Square in One Click (squareimages.co)

Show HN: The easiest accessibility (a11y) checker for VSCode (github.com)

Show HN: Dlg – Zero-cost printf-style debugging for Go (github.com)

Show HN: Companies use AI to take your calls. I built AI to make them for you (pipervoice.com)

Show HN: Open-source physical rack-mounted GUI for home lab (getubo.com)

Show HN: A high-altitude low-power flight computer for high-altitude balloons (github.com)

Show HN: Zero Waste Cloud – Finds 20-40% savings in AWS/GCP bills and CO2 impact (zerowastecloud.io)

Show HN: The Aria Programming Language (github.com)

Show HN: AgentGuard – Auto-kill AI agents before they burn through your budget (github.com)

Show HN: Convert from MIDI file to ASCII tablature (and more) (github.com)

Show HN: Online Ruler – Measuring in inches/centimeters (anruler.com)

Show HN: SafeRate – AI chat-native mortgage lender (saferate.com)

Show HN: Cant – Library written in Rust that provides PyTorch-like functionality (github.com)

Show HN: A GitHub Action that quizzes you on a pull request (github.com)

Show HN: MoebiusXBIN – ASCII and text-mode art editor with custom font support (blog.glyphdrawing.club)

Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL (github.com)

Show HN: Demitter – Distributed Node.js Event Emitter (Pub/Sub) (github.com)

Show HN: Use Their ID – Use your local UK MP’s ID for the Online Safety Act (use-their-id.com)

Show HN: I built an AI that turns any book into a text adventure game (kathaaverse.com)

Show HN: CUDA Fractal Renderer (github.com)

Show HN: Publican – an HTML-first static site generator for Node.js (publican.dev)

Show HN: Implementation of DDPM (Denoising Diffusion Probabilistic Models) (github.com)

Show HN: Monchromate – Smart greyscale browser extension (monochromate.lirena.in)

Show HN: I made a tool to generate photomosaics with your pictures (pictiler.com)

Show HN: A tool for complete WebSocket traffic control (websocket-devtools.com)

Hyrum's Law

Comments (49)