Just make it scale: An Aurora DSQL story

134 cebert 40 5/27/2025, 11:31:02 AM allthingsdistributed.com ↗

Comments (40)

Demiurge · 111d ago

Many interesting things, for instance, I've been hearing a lot about how fast Java is, that it can be as fast as C++, and then I see this:

> But after a few weeks, it compiled and the results surprised us. The code was 10x faster than our carefully tuned Kotlin implementation – despite no attempt to make it faster. To put this in perspective, we had spent years incrementally improving the Kotlin version from 2,000 to 3,000 transactions per second (TPS). The Rust version, written by Java developers who were new to the language, clocked 30,000 TPS.

I feel like there is more to this, like some kind of a bottleneck, memory footprint, some IO overhead?

> Our conclusion was to rewrite our data plane entirely in Rust.

The point is well taken, figuring it out is not worth it, if you can just "rewrote" or have green field projects.

> These extension points are part of Postgres’ public API, allowing you to modify behavior without changing core code

Also, interesting. So PostgreSQL evolved to the point that it has a stable API for extensibility? This great for the project, maintain a modular design, and some stable APIs and, you can let people mix and match and reduce duplication of effort.

anarazel · 111d ago

> So PostgreSQL evolved to the point that it has a stable API for extensibility?

Not across major versions, no. I seriously doubt we will ever make promises around that. It would hamper development way too much.

Demiurge · 111d ago

I see, then they're probably saying they found the internal APIs that are just more naturally stable, perhaps because they are close to the APIs used for extensions.

ramanh · 110d ago

> I feel like there is more to this, like some kind of a bottleneck, memory footprint, some IO overhead?

blocking/nonblocking IO can explain this numbers

kondro · 111d ago

It would be really great to get more context on what a DPU is for pricing: https://aws.amazon.com/rds/aurora/pricing/

I understand that AWS did one TPC-C 95/5 read/write benchmark and got 700k transactions for 100k DPUs, but that’s not nearly enough context.

There either needs to be a selection of other benchmark-based pricing (especially for a primarily 50/50 read/write load), actual information on how a DPU is calculated or a way to return DPU per query executed, not just an aggregate CloudWatch figure.

We were promised DSQL pricing similar to DynamoDB and insofar as it’s truly serverless (i.e. no committed pricing) they’ve succeeded, but one of the best parts of DynamoDB is absolute certainty on cost, even if that can sometimes be high.

belter · 111d ago

> one of the best parts of DynamoDB is absolute certainty on cost

That depends if its On Demand or Provisioned, even if they recently added On Demand limits.

kondro · 111d ago

You still have absolute certainty. Read or write x amount of data and it will use exactly y R/WCU.

It then just becomes a modeling problem allowing you to determine your costs upfront during design. That’s one of the most powerful features of the truly serverless products in AWS in my opinion.

ejkra · 110d ago

Absolute certainty is challenging with a cost-based optimizer in the mix. DDB doesn't face this challenge. Although, cost for some query patterns in DDB would shift into your application layer - so you may not have exactly the cost certainty you imagine?

Would you be willing to pay more for certainty? E.g. rent the full server at peak + 20% and run at 15% utilization some of the time? Provisioned capacity or pre-committed spend seem like reasonable, but perhaps more costly, ways to get certainty.

glzone1 · 111d ago

Early dsql had some weird limits I think - anyone actually using in production with feedback on current corners and limits?

Marbling4581 · 111d ago

I don't use it, but have been keeping an eye on it.

At launch, they limited the number of affected tuples to 10000, including tuples in secondary indexes. They recently changed this limit to:

> A transaction cannot modify more than 3,000 rows. The number of secondary indexes does not influence this number. This limit applies to all DML statements (INSERT, UPDATE, DELETE).

There are a lot of other (IMO prohibitive) restrictions listed in their docs.

https://docs.aws.amazon.com/aurora-dsql/latest/userguide/wor...

mjb · 111d ago

Which features would you like to see the team build first? Which limits would you like to see lifted first?

Most of the limitations you can see in the documentation are things we haven't gotten to building yet, and it's super helpful to know what folks need so we can prioritize the backlog.

avereveard · 111d ago

indexes! vector, trigram and maybe geospatial. (some may be in by now I didn't follow the service as closely as others)

note, doesn't have to be pg_vector pg_trgm or PostGIS, just the index component even if it's a clean room implementation would make this way more useful.

tomComb · 111d ago

The lack of JSONB is what stopped me.

loginatnine · 111d ago

Views and foreign keys!

mjb · 111d ago

Thanks. The team's working on both. For views, do you need updatable views, or are read-only views sufficient?

loginatnine · 111d ago

For me it's RO views.

tigy32 · 111d ago

I believe views were added to the preview a little while ago

edit from the launch: "With today’s launch, we’ve added support for AWS Backup, AWS PrivateLink, AWS CloudFormation, AWS CloudTrail, AWS KMS customer managed keys, and PostgreSQL views."

mjb · 111d ago

Correct: https://docs.aws.amazon.com/aurora-dsql/latest/userguide/wor...

sgarland · 111d ago

Why does it not support TRUNCATE?

jashmatthews · 111d ago

My understanding is the way Aurora DSQL distributes data widely makes bulk writes extremely slow/expensive. So no COPY, INSERT with >3k rows, TRUNCATE etc

sgarland · 111d ago

TRUNCATE is DROP TABLE + CREATE TABLE, it’s not a bulk delete. It bypasses the typical path for writes entirely.

pzduniak · 111d ago

Who would use Preview products in production? I'm building out some software that would fit perfectly into the constraints set for DSQL, but I realistically can't commit to something with no pricing / guarantees.

EwanToo · 111d ago

This blog post appears to be part of the scheduled launch marketing, it's now generally available

https://aws.amazon.com/blogs/aws/amazon-aurora-dsql-is-now-g...

loevborg · 111d ago

Which ones? It seems eminently usable from the outside now, at least for greenfield work. The subset of Postgres it supports is most of good/core/essential Postgres. (But I haven't tried it)

geodel · 111d ago

Good read. I like the part that both writing low level as well as high level component in Rust was proven worthwhile.

Maybe one can transform slow code from high level languages to low level language via LLMs in future. That can be nice performance boost for those who don't have Amazon engineers and budgets

mjb · 111d ago

> Maybe one can transform slow code from high level languages to low level language via LLMs in future.

This is one of the areas I'm most excited for LLM developer tooling. Choosing a language, database, or framework is a really expensive up-front decision for a lot of teams, made when they have the least information about what they're building, and very expensive to take back.

If LLM-powered tools could take 10-100x off the cost of these migrations, it would significantly reduce the risk of early decisions, and make it a ton easier to make software more reliable and cheaper to run.

It's very believable to me that, even with today's model capabilities, that 10-100x is achievable.

geodel · 111d ago

I remember many years back one of Go language author wrote C to Go trasformer and used that to convert all compiler, runtime, GC etc into Go.

Now in today's time some experts like above could create base transformer for high level language and frameworks to low level language and frameworks and this all get exposed via llm interfaces.

One can say why all this instead of generating fast binary directly from high level code. But generating textual transformation would give developers opportunity to understand, tweak and adjust transformed code which generating direct binary would not.

SahAssar · 111d ago

> Maybe one can transform slow code from high level languages to low level language

I think you are describing a compiler?

geodel · 111d ago

I mean reading this article:

1) Kotlin code --> Java byte code --> JVM execution (slow)

2) Kotin code --> Rust/Zig code --> Zig compiler --> native execution (fast)

Compiler is involved in both cases but I was thinking of 2) where slower code in high level lang is converted to another lang code. The compiler of which is known to produce fast runinng code.

dhosek · 111d ago

You’re describing a transpiler, but the problem is that idioms in a GC language like Kotlin don’t necessarily translate to a non-GC language like Rust or Zig. Add in the fact that Rust doesn’t have OO inheritance which is essential for a lot of JVM code to work (I don’t know much about Zig) and I’d be very suspicious of code generated by a Kotlin to Rust transpiler. (On the other hand, one of the first transpilers I ever encountered, web2c, worked well because the source language, Pascal, could be fairly easily translated into functional C without much if any sacrifice of speed or accuracy.)

bee_rider · 111d ago

Python -> C -> Assembly

Probably looks a lot like

Pseudocode -> C -> Assembly

Although the first is easier to run tests on and compare the outputs.

mrkeen · 110d ago

Where can I go to read about distributed SQL and big JOINs or WHERE IN clauses? I was hoping this article would cover that elephant in the room, rather than Rust being significantly more performant than JVM languages.

louis-paul · 110d ago

Marc Brooker has written and spoken about DSQL quite a bit. It’s still rather high level. I’d expect one or more papers to come out in the next few months, similarly to other Amazon databases.

https://brooker.co.za/blog/2025/04/17/decomposing.html (includes talk)

https://brooker.co.za/blog/2024/12/03/aurora-dsql.html

https://brooker.co.za/blog/2024/12/04/inside-dsql.html

https://brooker.co.za/blog/2024/12/05/inside-dsql-writes.htm...

https://brooker.co.za/blog/2024/12/06/inside-dsql-cap.html

https://brooker.co.za/blog/2024/12/17/occ-and-isolation.html

mrkeen · 104d ago

That's a lot of links for 0 info on distributed JOIN or WHERE IN.

karl_p · 111d ago

The JVM can relocate memory to avoid fragmentation. Rust can't, at least natively. Are they not worried about this regression?

geodel · 111d ago

Well Java need it because it fragments memory a lot. With Rust one has value types and stack allocation which takes care of one of the biggest cause of fragmentation.

kikimora · 111d ago

Writing code that would not fragment memory over time is arguable much harder than writing GC friendly code.

tigy32 · 111d ago

I haven't found that to be the case in my experience: just for example in java you tend to end up with essentially a lot of `Vec<Box<Thing>>` which causes a lot of fragmentation. In rust you tend to end up with `Vec<Thing>` where `Thing`s are inlined. (And replace Vec with the stack for the common case). I find it more like Java is better at solving a problem it created by making everything an object.

geodel · 111d ago

Yeah, cooking food in kitchen is much harder than having it delivered from restaurant at doorstep.

Reasonable people will see if cost makes it worthwhile.

mrkeen · 110d ago

With 10x the throughput (TPS) and the lack of GC pauses (which were the cause of the rewrite), how would they measure such a regression, let alone worry about it?

Hosting a website on a disposable vape (bogdanthegeek.github.io)

William Gibson Reads Neuromancer (2004) (bearcave.com)

React is winning by default and slowing innovation (lorenstew.art)

macOS Tahoe (apple.com)

Linux phones are more important now than ever (feddit.org)

Wanted to spy on my dog, ended up spying on TP-Link (kennedn.com)

Addendum to GPT-5 system card: GPT-5-Codex (openai.com)

I feel Apple has lost its alignment with me and other long-time customers (morrick.me)

PayPal to support Ethereum and Bitcoin (newsroom.paypal-corp.com)

GPT-5-Codex (openai.com)

How big a solar battery do I need to store all my home's electricity? (shkspr.mobi)

The Rising Sea: Foundations of Algebraic Geometry Notes (math.stanford.edu)

Massive Attack turns concert into facial recognition surveillance experiment (gadgetreview.com)

Launch HN: Trigger.dev (YC W23) – Open-source platform to build reliable AI apps

Debian Upgrade Marathon: 3.1 Sarge (wrongthink.link)

I wish my web server were in the corner of my room (2022) (interconnected.org)

From unit tests to whole universe tests (with will wilson of antithesis) [video] (youtube.com)

People Who Hunt Down Old TVs (bbc.com)

Show HN: Pooshit – Sync local code to remote Docker containers

CubeSats are fascinating learning tools for space (jeffgeerling.com)

When Your Father Is a Magician, What Do You Believe? (thereader.mitpress.mit.edu)

GPT‑5-Codex and upgrades to Codex (simonwillison.net)

How to self-host a web font from Google Fonts (blog.velocifyer.com)

The Mac App Flea Market (blog.jim-nielsen.com)

Boring work needs tension (iaziz786.com)

Removing newlines in FASTA file increases ZSTD compression ratio by 10x (log.bede.im)

GuitarPie: Electric Guitar Fretboard Pie Menus (andreasfender.com)

The Revised Report on Scheme or An UnCommon Lisp (1985) [pdf] (dspace.mit.edu)

Scryer Prolog Meetup 2025 (hsd-pbsa.de)

RustGPT: A pure-Rust transformer LLM built from scratch (github.com)

Turgot Map of Paris (en.wikipedia.org)

Death to type classes (jappie.me)

A qualitative analysis of pig-butchering scams (arxiv.org)

Which NPM package has the largest version number? (adamhl.dev)

Asciinema CLI 3.0 rewritten in Rust, adds live streaming, upgrades file format (blog.asciinema.org)

Self-Assembly Gets Automated in Reverse of 'Game of Life' (quantamagazine.org)

Show HN: AI-powered web service combining FastAPI, Pydantic-AI, and MCP servers (github.com)

Active NPM supply chain attack: Tinycolor and 40 Packages Compromised (socket.dev)

A string formatting library in 65 lines of C++ (riki.house)

Show HN: Daffodil – Open-Source Ecommerce Framework to connect to any platform (github.com)

Human writers have always used the em dash (theringer.com)

Apple has a private CSS property to add Liquid Glass effects to web content (alastair.is)

NASA's Guardian Tsunami Detection Tech Catches Wave in Real Time (jpl.nasa.gov)

Researchers revive the pinhole camera for next-gen infrared imaging (phys.org)

Linux for Nintendo 64 (1997) (web.archive.org)

The Culture novels as a dystopia (boristhebrave.com)

Show HN: Semlib – Semantic Data Processing (github.com)

PythonBPF – Writing eBPF Programs in Pure Python (xeon.me)

Not all browsers perform revocation checking (revoked-isrgrootx1.letsencrypt.org)

Varnish Cache to be renamed Vinyl Cache project (varnish-cache.org)

Just make it scale: An Aurora DSQL story

Comments (40)