"This telegram must be closely paraphrased before being communicated" Why? (history.stackexchange.com)

We had a critical service that often got overwhelmed, not by one client app but by different apps over time. One week it was app A, the next week app B, each with its own buggy code suddenly spamming the service.

The quick fix suggested was caching, since a lot of requests were for the same query. But after debating, we went with rate limiting instead. Our reasoning: caching would just hide the bad behavior and keep the broken clients alive, only for them to cause failures in other downstream systems later. By rate limiting, we stopped abusive patterns across all apps and forced bugs to surface. In fact, we discovered multiple issues in different apps this way.

Takeaway: caching is good, but it is not a replacement for fixing buggy code or misuse. Sometimes the better fix is to protect the service and let the bugs show up where they belong.

andersmurphy · 36m ago

I guess CPUs are pretty buggy with all their caches. If only the hardware people could fix their buggy systems.

In all seriousness sometimes a cache is what you need. Inline caching is a classic example.

zeras · 2h ago

I think a fundamental mistake I see many developers make is they use caching trying to solve problems rather than improve efficiency.

It's the equivalent of adding more RAM to fix poor memory management or adding more CPUs/servers to compensate for resource heavy and slow requests and complex queries.

If your application requires caching to function effectively then you have a core issue that needs to be resolved, and if you don't address that issue then caching will become the problem eventually as your application grows more complex and active.

chamomeal · 2h ago

Idk I think caching is a crucial part of many well-designed systems. There’s a lot of very cache-able data out there. If invalidating events are well defined or the data is fine being stale (week/month level dashboards, for example), that’s a fantastic reason to use a cache. I’d much rather just stuff those values in a cache than figure out any other more complicated solution.

I also just think it’s a necessary evil of big systems. Sometimes you need derived data. You can even think about databases as a kind of cache: the “real” data is the stream of every event that ever updated data in the database! (Yes this stretching the meaning of cache lol)

However I agree that caching is often an easy bandaid for a bad architecture.

This talk on Apache Samza completely changed how I think about caching and derived data in general: https://youtu.be/fU9hR3kiOK0?si=t9IhfPtCsSyszscf

And this interview has some interesting insights on the problems that caching faces at super large scale systems (twitter specifically): https://softwareengineeringdaily.com/2023/01/12/caching-at-t...

hinkley · 1h ago

There are a lot of things necessary to be a successful human but doing them without doing the fundamentals just makes you a monkey in a suit.

Caching belongs at the end of a long development arc. And it will be the end whether you want it too or not. Adding caching is the beginning of the end of large architectural improvements, because caches jam up the analysis and testing infrastructure. Everything about improving or adding features to the code slows down, eventually to a crawl.

hinkley · 1h ago

> It's the equivalent of adding more RAM to fix poor memory management

No it’s ten times worse than that. Adding RAM doesn’t make the task of fixing the memory management problems intrinsically harder. It just makes the problem bigger when you do fix it.

Adding caching to your app makes all of the tools used for detecting and categorizing performance issues much harder to use. We already have too many developers and “engineers” who balk at learning more than the basics of using these tools. Caching is like stirring up sediment in a submarine cave. Now only the most disciplined can still function and often just barely.

When you don’t have caches, data has to flow along the call tree. So if you need a user’s data in three places, that data either flows to those three or you have to look it up three times, which can introduce concurrency issues if the user metadata changes in the middle of a request. But because it’s inefficient there is clear incentive to fix the data propagation issues. Fixing those issues will make testing easier because now the data is passed in instead of having to mock the lookup code.

Then you introduce caching. Now the incentive is mostly gone, since you will only improve cold start performance. And now there is a perverse incentive to never propagate the data again. You start moving backward. Soon there are eight places in the code that use that data, because looking it up was “free” and they are all detached from each other. And now you can’t even turn off the cache, and cache traffic doesn’t tell you what your costs are.

And because the lookup is “free” the user lookup code disappears from your perf data and flame graphs. Only a madman like me will still tackle such a mess, and even I have difficulty finding the motivation.

For these reasons I say with great confidence and no small authority: adding caching to your app is the last major performance improvement most teams will ever see. So if you reach for it prematurely, you’re stuck with what you’ve got. Now a more astute competitor can deliver a faster, cheaper, or both product that eats your lunch and your team will swear there is nothing they can do about it because the app is already as fast as they can make it, and here are the statistics that “prove” it.

Friends don’t let friends put caches on immature apps.

lemmsjid · 52m ago

I’d say a useful way of thinking about caching is through the lens of the CAP theorem. You are facing a situation where compute requirements exceed the bounds of a single process. There are a variety of things you can do here, all with consequences to the Consistency aspect of your data. Two strategies with consequences are caching and horizontal scaling. So look to vertical scaling or efficiencies in data modeling first.

I like your comment btw. I’d add Observability to CAP to incorporate what you’re saying.

cortesoft · 1h ago

> If your application requires caching to function effectively then you have a core issue that needs to be resolved, and if you don't address that issue then caching will become the problem eventually as your application grows more complex and active.

I don’t think this is always true. Sometimes your app simply has data that takes a lot of computation to generate but doesn’t need to be generated often. Any way you solve this is going to be able to be described as a ‘cache’ even if you are just storing calculations in your main database. That doesn’t mean your application has a fundamental design flaw, it could mean your use case has a fundamental cache requirement.

simonw · 3h ago

A friend of mine once argued that adding a cache to a system is almost always an indication that you have an architectural problem further down the stack, and you should try to address that instead.

The more software development experience I gain the more I agree with him on that!

jedberg · 25m ago

If you have no cache, and your first thought is "this needs a cache", you're probably right. Chances are you need to optimize a query or storage pattern. But you're thinking like an engineer. It may be true that there is a "more correct" engineering solution, but adding a cache might be the most expedient solution.

But after you'd done all the optimizations, there is still a use case for caches. The main one being that a cache holds a hot set of data. Databases are getting better at this, and with AI in everything, latency of queries is getting swamped by waiting for the LLM, but I still see caches being important for decades to come.

hinkley · 1h ago

When all else fails, use caches. If all else hasn’t failed, it will once you use caches.

IgorPartola · 2h ago

If you think of it as a cache, yes. If you think of it as another data layer then no.

For example, let’s say that every web page your CMS produces is created using a computationally expensive compilation. But the final product is more or less static and only gets updated every so often. You can basically have your compilation process pull the data from your source of truth such as your RSBMS but then store the final page (or large fragments of it) in something like MongoDB. In other words the cache replacement happens at generation time and not on demand. This means there is always a cached version available (though possibly slightly stale), and it is always served out of a very fast data store without expensive computation. I prefer this style of caching to on demand caching because it means you avoid cache invalidation issues AND the thundering herd problem.

Of course this doesn’t work for every workflow but I can get you quite far. And yes this example can also be sort of solved with a static site generator but look beyond that at things like document fragments, etc. This works very well for dynamic content where the read to write ratio is high.

hinkley · 1h ago

No.

It’s not a data layer, it’s global shared state. Global shared state always has consequences. Sometimes the consequences are worth the trouble. But it is trouble.

If you think about Source of Truth, System of Record, cache is neither of those, and sits between them. There’s a lot of problems you can fix instead by improving the SoT or SoR situation in that area if the code.

convolvatron · 39m ago

in particular, the database already _has_ a cache. usually its on the other side of the evaluation, at the block layer. which means that you have a pay a cost to get to it (the network protocol, and the evaluation).

if you use materialized views, that surfaces exactly what you want in a cache, except here the views consistency with the underlying data is maintained. that's hugely important.

that leaves us with the protocol. prepared statements might help. now we really should be about the same as the bump-on-the-wire cache. that doesn't get us the same performance is the in-process cache. but we didn't have to sacrifice any performance or add any additional operational overhead to get it.

lemmsjid · 1h ago

Quite agree, this is how I explain it to people. When you think of cache as another derived dataset then you start to realize that the issues caches bring to architectures are often the result of not having an agreement between the business and engineering on acceptable data consistency tolerances. For example, outside the world of caching, if you email users a report, and the data is embedded in the email, then you are accepting that the user will see a snapshot of data at a particular time. In many cases this is fine, even preferred. Sometimes not, and instead you link the user to a realtime dashboard instead.

Pretty much every view the user sees of data should include an understanding as to how consistent that data is with the source of truth. Issues with caching (besides basic bugs) often come up when a performance issue comes up and people slap in a cache without renegotiating how the end user would expect the data to look relative to its upstream state.

hinkley · 58m ago

The cache is an incomplete dataset by definition. It’s not a data set, it’s a cache of a data set. You can never ensure you get a clean read of the system state from the cache because it’s never in sync and has gaps.

chamomeal · 2h ago

I already typed a longer comment elsewhere that I don’t feel like reiterating but I agree with you. Caching is a natural outcome of not having infinite time and memory for running programs. Sometimes it’s a bandaid over bad design, but often it’s a responsible decision to take load off of other important systems

cpursley · 2h ago

Lost me at DumpsterFireDB as cache. But if the goal is to create an even worse architecture thats even harder to maintain, go for it.

IgorPartola · 1h ago

Sorry you lack the imagination to substitute your preferred data store into what I wrote. Hope it gets easier.

cpursley · 1h ago

I'll never have enough imagination to believe mongo is a good solution. Postgres has jsonb, vector type; redis is a fine-enough cache. Why use a known junk "database" when there are superior solutions and truly open source?

DrBazza · 3h ago

I'd argue the database falls into that category.

The two questions no one seems to ask are 'do I even need a database?', and 'where do I need my database?'

There are alternate data storage 'patterns' that aren't databases. Though ultimately some sort of (Structure) query language gets invented to query them.

jitl · 3h ago

Yeah my architecture problem is that Postgres RDS EBS storage is slow as dog. Sure our data won’t go poof if we lose an instance but it’s so slow.

(It’s not really my architecture problem. My architecture problem is that we store pages as grains of sand in a db instead of in a bucket, and that we allow user defined schemas)

barrkel · 3h ago

Caches suck because invalidation needs to be sprinkled all over the place in what is often an abstraction-violating way.

Then there's memoization, often a hack for an algorithm problem.

I once "solved" a huge performance problem with a couple of caches. The stain of it lies on my conscience. It was actually admitting defeat in reorganizing the logic to eliminate the need for the cache. I know that the invalidation logic will have caused bugs for years. I'm sure an engineer will curse my name for as long as that code lives.

jmull · 3h ago

That's true in my experience.

Caches have perfectly valid uses, but they are so often used in fundamentally poor ways, especially with databases.

AtheistOfFail · 3h ago

I disagree. For large search pages where you're building payloads from multiple records that don't change often, it could be beneficial to use a cache. Your cache ends up helping the most common results to be fetched less often and return data faster.

tengbretson · 3h ago

Maybe these distinctions are useful to people in some situations, but to me this reads like wondering whether we can replace houses with buildings.

jayd16 · 2h ago

More like they're stocking the fridge and wondering what living next to the market is like.

eatonphil · 2h ago

Many of these points are not compelling to me when 1) you can filter both rows and columns (in postgres logical replication anyway [0]) and 2) SQL views.

[0] https://www.postgresql.org/docs/current/logical-replication-...

avinassh · 2h ago

Is it possible to create a filter that can work over a complex join operation?

That's what IVM systems like Noria can do. With application + cache, the application stores the final result in the cache. So, with these new IVM systems, you get that precomputed data directly from the database.

Views in Postgres are not materialized right? so every small delta would require refresh of entire view.

hoppp · 4h ago

The cache service is a database of sorts that usually stores key value pairs.

The difference is in persistence and scaling and read/write permissions

barrkel · 3h ago

No, what makes a cache a cache is invalidation. A cache is stale data. It's a latent out of date calculation. It's misinformation that risks surviving until it lies to the user.

jedberg · 20m ago

This is true but a lot of the trouble in invalidation can be avoided by using smarter cache keys.

For example, on reddit, fully rendered comments are cached, so that the renderer doesn't have to redo its work. But the cache key includes the date of the last edit on the comment, which is already known when requesting the value from the cache. In this way, you never have to invalidate that key, because editing the comment makes a new key. The old one will just get ejected eventually.

Supermancho · 3h ago

ie A cache is a database. The difference is features and usage.

hinkley · 54m ago

A database is usually a union of all of the questions that can be asked about a topic. A cache by definition is a subset of that. Subsets are not the sets. And if you treat them as if they are, which 90% of people do, you’re gonna have a bad time.

jamesblonde · 1h ago

Some of these questions are informed by the Redis/DynamoDB or Postgres/MySQL world the author seems to inhabit.

Why would you want to do this? "I don’t know of any database built to handle hundreds of thousands of read replicas constantly pulling data."

If you want an open-source database with Redis latencies to handle millions of concurrent reads, you can use RonDB (disclaimer, I work on it).

"Since I’m only interested in a subset of the data, setting up a full read replica feels like overkill. It would be great to have a read replica with just partial data. It would be great to have a read replica with just partial data."

This is very unclear. Redis returns complete rows because it does not support pushdown projections or ordered indexes. RonDB supports these and distion aware partition-pruned index scans (start the transaction on the node/partition that contains the rows that are found with the index).

Reference:

https://www.rondb.com/post/the-process-to-reach-100m-key-loo...

xixixao · 3h ago

This is a good deep dive into the complexity around caching: https://stack.convex.dev/caching-in

Having caching by default (like in Convex) is a really neat simplification to app development.

gethly · 1h ago

Event-sourcing is a powerful tool that helps with exactly this. Why spin up a cache server when you can spin up another read DB instance for the same price and get unlimited capabilities...

jayd16 · 2h ago

So I guess this guy wants Firestore (or the OSS equivalent)?

cbsmith · 4h ago

So close to getting push driven architecture...

phoronixrly · 4h ago

Rails also has a take on this https://github.com/rails/solid_cache

"This telegram must be closely paraphrased before being communicated" Why? (history.stackexchange.com)

Launch HN: VibeFlow (YC S25) – Web app generator with visual, editable workflows

When the sun will literally set on what's left of the British Empire (oikofuge.com)

How many HTTP requests/second can a Single Machine handle? (binaryigor.com)

Jujutsu for Everyone (jj-for-everyone.github.io)

How is Ultrassembler so fast? (jghuff.com)

The Case for Crazy Philanthropy (palladiummag.com)

Code Is Debt (tornikeo.com)

Eternal Struggle (yoavg.github.io)

I Don't Have Spotify (idonthavespotify.sjdonado.com)

Infisical (YC W23) Is Hiring Solutions Engineers to Scale the OSS Security Stack (ycombinator.com)

Installing UEFI Firmware on ARM SBCs (interfacinglinux.com)

Why haven't quantum computers factored 21 yet? (algassert.com)

F-Droid site certificate expired (gitlab.com)

Show HN: Anonymous Age Verification (gist.github.com)

Notes on Managing ADHD (borretti.me)

Plastic Before Plastic: How gutta-percha shaped the 19th century (worldhistory.substack.com)

The Last Vestal Virgin and the Fall of Rome (debramaymacleod.com)

Lord of the Io_uring (unixism.net)

My phone is an ereader now (davepagurek.com)

Vibe coding as a coding veteran: from 8-bit assembly to English-as-code (levelup.gitconnected.com)

Use One Big Server (2022) (specbranch.com)

A 20-Year-Old Algorithm Can Help Us Understand Transformer Embeddings (ai.stanford.edu)

No clicks, no content: The unsustainable future of AI search (bradt.ca)

How to run latest Vegas Pro 22 in Windows 7 x64 (trackerninja.codeberg.page)

A 'Third Way' Between Buying or Renting? Swiss Co-Ops Say They've Found It (nytimes.com)

Bitwig Studio 6 details revealed, and editing gets a big boost (cdm.link)

Ask HN: How do you fight YouTube addiction and procrastination? I'm struggling

It's So Easy to Prompt Inject Perplexity Comet

FDA official demands removal of YouTube videos of himself criticizing vaccines (theguardian.com)

Show HN: An ncurses CUDA-based fluid simulation (github.com)

eBPF 101: Your First Step into Kernel Programming (journal.hexmos.com)

Nobody cares about decentralization until they do (2024) (kyefox.com)

Cognitive load is what matters (github.com)

My Foray into Vlang (kristun.dev)

Rose Scent Increases Brain Gray Matter (sciencealert.com)

Running our Docker registry on-prem with Harbor (dev.37signals.com)

You Have to Feel It (mitchellh.com)

New research reveals longevity gains slowing, life expectancy of 100 unlikely (lafollette.wisc.edu)

What Are Traces and Spans in OpenTelemetry? (oneuptime.com)

Sheafification – The optimal path to mathematical mastery: The fast track (2022) (sheafification.com)

Cline and LM Studio: the local coding stack with Qwen3 Coder 30B (cline.bot)

Is it possible to allow sideloading and keep users safe? (shkspr.mobi)

Oakland to silence police radios from public beginning Wednesday (mercurynews.com)

Anthropic's surprise settlement adds new wrinkle in AI copyright war (reuters.com)

Red: A programming language inspired by REBOL (github.com)

Run a legal LTE network at home for $100 (lantian.pub)

Shared_ptr<T>: the (not always) atomic reference counted smart pointer (2019) (snf.github.io)

Google: 'Your $1000 phone needs our permission to install apps now' [video] (youtube.com)

An eyecare foundation model for clinical assistance (nature.com)

Replacing a Cache Service with a Database

Comments (40)