ClickHouse matches PG for single-row UPDATEs and 4000 x faster for bulk UPDATEs

65 truth_seeker 25 8/17/2025, 5:52:43 PM clickhouse.com ↗

Comments (25)

jedberg · 1h ago

> Caveat: PostgreSQL is fully transactional by default; ClickHouse isn’t. Results compare each engine’s native execution model, not identical transaction guarantees.

I appreciate that they call this out. This means that if you're using Postgres primarily for storing data where it's ok if you lose a few (like time series data) then Clickhouse is the clear and obvious choice.

But if the transactionality is key for you (like a financial application or the data for an application) then Postgres still makes more sense.

compumike · 21m ago

PostgreSQL defaults to: "fsync = ON" and "synchronous_commit = ON".

ClickHouse defaults to: "fsync_after_insert = false".

A more fair comparison might at least do "SET synchronous_commit TO OFF" on the PostgreSQL side. (But for this UPDATE benchmark, I think the results would still largely be the same.)

oulipo · 16m ago

Since Postgres is open-source and the kind of database model they use is out there for so long, how come ClickHouse can't have a "postgres subset" (or just include a mini-Postgres) so that it gets the best of both worlds out of the box?

joshstrange · 51m ago

I only use CH for work so I'll read about this more on Monday but I shudder to think of the caveats. We have used cancelling rows and now the one of the merge engines that just needs a version (higher cancels out lower). No database has ever driven me more mad than Clickhouse. If your workload is append-only/insert-only then congrats, it's amazing, you'll have a great time. If you need to update data... Well, strap in.

As long as you can get away with Postgres, stay with Postgres. I'm sure this update here is a step forward just like version-merging is much better than cancelling rows but it's always got a ton of downsides.

Unrelated to updating data, the CH defaults drive me insane, the null join behavior alone made me reconsider trying to rip CH out of our infrastructure (after wasting too long trying to figure out why my query "wasn't working").

Lastly I'll say, if CH does what you need and you are comfortable learning all the ends and outs, then it can do some really cool things. But it's important to remember it's NOT a normal RDMS nor can you use it like one. I almost wish they didn't use SQL as the query language, then people would think about it differently, myself included.

saisrirampur · 28m ago

Very interesting take — I see where you’re coming from. Yes, there are caveats and differences between ClickHouse and Postgres. Much of this stems from the nature of the workloads they are built for: Postgres for OLTP and ClickHouse for OLAP.

We’ve been doing our best to address and clarify these differences, whether through product features like this one or by publishing content to educate users. For example: https://clickhouse.com/blog/postgres-to-clickhouse-data-mode... https://www.youtube.com/watch?v=9ipwqfuBEbc.

From what we’ve observed, the learning curve typically ranges from a few weeks for smaller to medium migrations to 1–2 months for larger ones moving real-time OLAP workloads from Postgres to ClickHouse. Still, customers are making the switch and finding value — hundreds (or more) are using both technologies together to scale their real-time applications: Postgres for low-latency, high-throughput transactions and ClickHouse for blazing-fast (100x faster) analytics.

We’re actively working to bridge the gap between the two systems, with features like faster UPDATEs, enhanced JOINs and more. That’s why I’m not sure your comment is fully generalizable — the differences largely stem from the distinct workloads they support, and we’re making steady progress in narrowing that gap.

- Sai from the ClickHouse team here.

smarx007 · 14m ago

How much of the ISO/IEC 9075:2023 SQL standard does CH conform to?

oulipo · 14m ago

What would be the best Postgres + CH setup to combine both? somethign using CDC and apply them to CH?

saisrirampur · 5m ago

Great question, exactly CDC from Postgres to ClickHouse and adapting the application to start using ClickHouse for analytics. Through the PeerDB acquisition, ClickHouse now has native CDC capabilities that work at any scale (few 10s of GB to 10s of TB Postgres databases). You can use ClickPipes if you’re on ClickHouse Cloud, or PeerDB if you’re using ClickHouse OSS.

Sharing a few links for reference: https://clickhouse.com/docs/integrations/clickpipes/postgres https://github.com/PeerDB-io/peerdb https://clickhouse.com/cloud/clickpipes/postgres-cdc-connect...

Here is a short demo/talk that we did at our annual conferemce Open House that talks about this reference architecture https://clickhouse.com/videos/postgres-and-clickhouse-the-de...

YZF · 27m ago

You might want to think about converting your updates into some sort of event sourcing scheme where you insert new rows and then do aggregation. That pattern would be more appropriate for ClickHouse.

If you are needing updates then perhaps ClickHouse isn't the perfect choice. Something like ScyllaDB might be a better compromise if you want performant updates with (some sort of) consistency guarantees. If you need stronger guarantees you will need a "proper" database but then you're unlikely to get the performance. AKA tradeoffs or no free lunch.

barrkel · 35m ago

CH is better for analytics, where append only is the normal mode of operation, but I've used it in the past as an index. Store a copy of data in Clickhouse and use its vectorized columnar operations for ad hoc queries (the kind where indexes don't help because the user may query by any field they like). This can work well if your data is append-mostly and you do a rebuild yourself avter a while, but the way it sounds, Clickhouse is making it possible to get that to work well with a higher ratio of updates.

Either way, CH shouldn't be the store of truth when you need record level fidelity.

mrits · 23m ago

I ported an entire analytic solution from SQL Server to clickhouse in a few months. While the workarounds for updates aren't great it didn't come as a surprise since I've used other similar databases. The joining/null behavior is called out in the documentation so that wasn't a surprise either.

CH has been my favorite database since I discovered PostgreSQL 20 years ago. My view point is don't use postgres unless you can't use CH.

trollied · 44m ago

4000 times faster is hardly difficult if you’re not trying to be ACID compliant. I can write to a flat file that doesn’t give a shit about integrity very quickly - it’s not a boast.

orf · 33m ago

Interesting read, but I feel like they should have also benchmarked using COPY with Postgres. This should be far faster than a bulk insert, and it’s more in line with what they are benchmarking.

The omission feels… odd.

sdairs · 29m ago

To be honest, I just didn't think of it. But thanks for the suggestion, we'll give it a go!

AdriaanvRossum · 2h ago

> TL;DR: PostgreSQL wraps every statement in a fully transactional context; ClickHouse doesn’t. That means these results aren’t a perfect measure of transaction performance. However, they’re still an interesting and relevant look at how each system handles update workloads under its native execution model.

ndriscoll · 1h ago

tl;dr Doing a bunch of synchronous writes and full table scans with extremely slow, high latency storage (for a dataset that easily fits in RAM) is slow.

samsk · 1h ago

My tl;dr is: Don't do huge updates using WHERE on fields _not_ in index, otherwise it will take days to finish and you get meaningless numbers in benchmarks...

hn-user-42 · 1h ago

I learned that in a hard way. It was midnight, I had to make some urgent changes to the user table in the production database. We had about 4 million users and needed to make changes to 70k users (due a change in 3rd party).

I figured out after 40 minutes that we are not making any progress. For each user we are probably searching 4 million users. After I added the index to a specific field, it was done quickly.

sdairs · 1h ago

One of the post authors here; I just want to stress that the goal of the post was not at all to be "haha we're better than postgres" - we explicitly call out the caveats behind the results.

Prior to July 2025 (ClickHouse v25.7), ClickHouse did not support UPDATE statements. At all. Like most columnar, analytics databases. We spent a lot of effort designing and implementing a way to support high-performance, SQL-standard UPDATE statements. This test was pretty much just trying to see how good of a job we had done, by comparing ourselves to the gold standard, Postgres. (If you're curious, we also wrote about how we built the UPDATE support in depth https://clickhouse.com/blog/updates-in-clickhouse-2-sql-styl...)

We have some updates to the post in progress; we originally deliberately used cold runs for both ClickHouse & Postgres, because we wanted to look at the "raw" update speed of the engine, vs. the variability of cache hits. But TL;DR when you run a more "real world" test where caches are warm and Postgres is getting very high cache-hit ratio, its point updates are consistently ~2ms, while ClickHouse is somewhere ~6ms (bulk updates are still many multiples faster in ClickHouse even with the cache in play).

mdaniel · 2m ago

I see the Jepsen for Keeper (and, my sincere gratitude for it!); have you considered Jepsen for CH itself?

mrlongroots · 51m ago

Another thought: this is not meant as criticism or anything, just an aspect of performance that I thought was interesting but not covered by the blog.

A test that would show PG's strengths over ClickHouse for OLTP would be a stress test with a long-running set of updates.

ClickHouse maintains updates as uncompacted patches merged in the background, which is how you would do it with a columnar store. But if you have an update-heavy workload, these patches would accumulate and your query performance would start to suffer. PG on the other hand completes all update work inline, and wouldn't get degrading performance under update-heavy regimes.

This is just a fundamental artifact of OLAP vs OLTP, maybe OLAP can be optimized to the point where it doesn't really matter for most workloads, but a theoretical edge remains with row-based stores and updates.

sdairs · 31m ago

I think we do allude this in the post, if my understanding is correct. AFAIK both ClickHouse and Postgres are using MVCC, so when an UPDATE is run, a new version of the data is written and the original data is not changed. In ClickHouse, we write a patch part, it's merged automatically by a background process or by FINAL. In Postgres, you write a new tuple and mark the old one as dead, which will be cleared up by a VACUUM.

I'd wager the overhead in Postgres is probably lighter (though it's also light in ClickHouse), so you're right, this would be an interesting test. We actually had something like that planned, to run concurrent UPDATEs and SELECTs at different volumes for a period of time, to see how they each cope. Will definitely do it!

fiddlerwoaroof · 35m ago

I haven’t read that post yet. I assume you merge the updates into the main data files when you merge the segments of the log-structured merge tree?

samwillis · 1h ago

Really interesting post, and well done on setting up the Apples to Oranges nature of the benchmark, you're very clear. It's really interesting to see the deference the district architectures make.

Did you run any tests with the new transaction system in ClickHouse? It would be super interesting to see how it effected the batch updates.

sdairs · 56m ago

If you're refering to the experimental ACID transactions, we didn't test them; I'll check what state they're in and see if its worth including yet, if not, we'll for sure come back to them later and do a similar test!

When Did AI Take Over Hacker News? (zachperk.com)

Claudia – Desktop companion for Claude code (claudiacode.com)

HN Search isn't ingesting new data since Friday (github.com)

Review of Anti-Aging Drugs (scienceblog.com)

AI Doesn't Lighten the Burden of Mastery; AI Makes It Easy to Stop Valuing It (playtechnique.io)

ClickHouse matches PG for single-row UPDATEs and 4000 x faster for bulk UPDATEs (clickhouse.com)

LL3M: Large Language 3D Modelers (threedle.github.io)

Who does your assistant serve? (xeiaso.net)

ArchiveTeam has finished archiving all goo.gl short links (tracker.archiveteam.org)

Derivatives, Gradients, Jacobians and Hessians (blog.demofox.org)

The Enterprise Experience (churchofturing.github.io)

LinkedIn rewards mediocrity (elliotcsmith.com)

Secure Boot, TPM and Anti-Cheat Engines (andrewmoore.ca)

Show HN: OverType – A Markdown WYSIWYG editor that's just a textarea

Faster Index I/O with NVMe SSDs (marginalia.nu)

Does OLAP Need an ORM (clickhouse.com)

Here be dragons: Preventing static damage, latchup, and metastability in the 386 (righto.com)

A Visual Exploration of Gaussian Processes (2019) (distill.pub)

Show HN: Fallinorg - Offline Mac app that organizes files by meaning (fallinorg.com)

IQ Tests Results for AI (trackingai.org)

Show HN: NextDNS Adds "Bypass Age Verification"

'Safety Today Is a Luxury,' Giorgetto Giugiaro Says After His Crash (jalopnik.com)

BBC Micro: The Ancestor to a Device You Are Guaranteed to Own (retrogamecoders.com)

Dynasties, Selection, and Talent Allocation Among Classical Composers (papers.ssrn.com)

Lessons learned from building a sync-engine and reactivity system with SQLite (finkelstein.fr)

Why Computer-Use Agents Should Think Less (prava.co)

Eliminating JavaScript cold starts on AWS Lambda (goose.icu)

The Photographic Periodic Table of the Elements (periodictable.com)

Wan – Open-source alternative to VEO 3 (github.com)

Endoscopist deskilling risk after exposure to AI in colonoscopy (thelancet.com)

VictoriaLogs Practical Ingestion Guide for Message, Time and Streams (victoriametrics.com)

Dispelling misconceptions about RLHF (aerial-toothpaste-34a.notion.site)

SuperSight: A graphical enhancement mod for Brøderbund's "Stunts" (marnetto.net)

LLMs tell bad jokes because they avoid surprises (danfabulich.medium.com)

Show HN: Rust macro utility for batching expensive async operations (github.com)

Guid Smash (guidsmash.com)

Comparison of different C libraries providing generic containers capabilities (github.com)

Programming in the Age of Abundance (guyren.me)

Nuvistor Valves (r-type.org)

Show HN: Chatbang – Access ChatGPT from the terminal without an API key (github.com)

That 16B password story (a.k.a. "data troll") (troyhunt.com)

An argument for increasing TCP's initial congestion window (2024) (jeclark.net)

GDPR meant nothing: chat control ends privacy for the EU [video] (youtube.com)

Good multipliers for congruential pseudorandom number generators (arxiv.org)

FFmpeg moves to Forgejo (code.ffmpeg.org)

Counting Words at SIMD Speed (healeycodes.com)

A Lisp in 99LOC (github.com)

Living with Williams Syndrome, the 'opposite of autism' (2014) (bbc.com)

Node.js is able to execute TypeScript files without additional configuration (nodejs.org)

ResurrectedGod: The Ruby Framework for Process Management (github.com)

ClickHouse matches PG for single-row UPDATEs and 4000 x faster for bulk UPDATEs

Comments (25)