Show HN: Hydra (YC W22) – Serverless Analytics on Postgres

60 coatue 33 5/9/2025, 3:24:35 PM hydra.so ↗

Hi HN, Hydra cofounders (Joe and JD) here (https://www.hydra.so/)! We enable realtime analytics on Postgres without requiring an external analytics database.

Traditionally, this was unfeasible: Postgres is a rowstore database that’s 1000X slower at analytical processing than a columnstore database.

(A quick refresher for anyone interested: A rowstore means table rows are stored sequentially, making it efficient at inserting / updating a record, but inefficient at filtering and aggregating data. At most businesses, analytical reporting scans large volumes of events, traces, time-series data. As the volume grows, the inefficiency of the rowstore compounds: i.e. it's not scalable for analytics. In contrast, a columnstore stores all the values of each column in sequence.)

For decades, it was a requirement for businesses to manage these differences between the row and columnstore’s relative strengths, by maintaining two separate systems. This led to large gaps in both functionality and syntax, and background knowledge of engineers. For example, here are the gaps between Redshift (a popular columnstore) and Postgres (rowstore) features: (https://docs.aws.amazon.com/redshift/latest/dg/c_unsupported...).

We think there’s a better, simpler way: unify the rowstore and columnstore – keep the data in one place, stop the costs and hassle of managing an external analytics database. With Hydra, events, traces, time-series data, user sessions, clickstream, IOT telemetry, etc. are now accessible as a columnstore right alongside my standard rowstore tables.

Our solution: Hydra separates compute from storage to bring the analytics columnstore with serverless processing and automatic caching to your postgres database.

The term "serverless" can be a bit confusing, because a server always exists, but it means compute is ephemeral and spun up and down automatically. The database automatically provisions and isolates dedicated compute resources for each query process. Serverless is different from managed compute, where the user explicitly chooses to allocate and scale CPU and memory continuously, and potentially overpay during idle time.

How is serverless useful? It's important that every analytics query has its own resources per process. The major hurdles with running analytics on Postgres is 1) Rowstore performance 2) Resource contention. #2 is very often overlooked - but in practice, when analytics queries are run they tend to hog resources (RAM and CPU) from Postgres transactional work. So, a slightly expensive analytics query has the ability to slow down the entire database: that's why serverless is important: it guarantees the expensive queries are isolated and run on dedicated database resources per process.

why is hydra so fast at analytics? (https://tinyurl.com/hydraDBMS) 1) columnstore by default 2) metadata for efficient file-skipping and retrieval 3) parallel, vectorized execution 4) automatic caching

what’s the killer feature? hydra can quickly join columnstore tables with standard row tables within postgres with direct sql.

example: “segment events as a table.” Instead of dumping segment event data into a s3 bucket or external analytics database, use hydra to store and join events (clicks, signups, purchases) with user profile data within postgres. know your users in realtime: “what events predict churn?” or “which user will likely convert?” is immediately actionable.

Thanks for reading! We would love to hear your feedback and if you'd like to try Hydra now, we offer a $300 credit and 14-days free per account. We're excited to see how bringing the columnstore and rowstore side-by-side can help your project.

Comments (33)

thawab · 63d ago

Hello Joe, thanks a lot for hydra and pg_duckdb. I wanted to confirm that for self hosting hydra i have to generate a token from your platform? what data is shared with hydra for this case. We need to double check as our data has restriction of sharing.

> Visit http://platform.hydra.so/token to fetch the access token and paste it into the section above.

coatue · 63d ago

Hello thawab, yes! you can self-host Hydra with a token from the platform. Sign-up and visit that URL to take you to the right spot. We call it Bare Metal deployment, here's 1 minute setup guide (https://docs.hydra.so/guides/bare_metal)

thawab · 63d ago

thanks a lot, the other part of the question:

1- what data is shared with hydra for this case?

2- whats the pricing for the bare metal deployment?

coatue · 63d ago

billing (usage) metrics so we know what to charge. We offer BYOC 'Bare Metal' deployments as part of the Business plan. You can set it up now, but we offer volume discounts so you should talk to our team directly. Feel free to DM me on X (@JoeSciarrino) or email founders@

nlh · 62d ago

This is super cool and congrats!

Some questions - I understand all of these concepts but so far everything seems to be aimed at people way more immersed in this world than I, so pardon any dumb queries:

I have a small website I run. Everything is currently hosted on Fly.io (sjc) and I’m using Postgres as my main db. I’m about to add a whole bunch of features related to analytics and was dreading having to spend a week learning Clickhouse, so was just going to use Postgres until things get too big and slow to be useful.

Is Hydra aimed at folks like me? I see you guys are also hosted with Fly which is great but in the Virginia region. Am I out of luck unless I move my app to VA? Am I basically giving up my own Postgres instance and porting everything to Hydra?

Thanks for clarifying - your site kinda mostly covers this stuff but not entirely crisply so I’m a bit puzzled.

coatue · 62d ago

[Joe, Hydra cofounder] Hey, thanks for the kudos! Sounds like a nice fit and that's coincidentally good timing! We started with the Virginia region, but we can focus on SJC next. With 35 regions to cover, we're prioritizing based on user requests - so thanks for mentioning it.

Ideally, you can easily switch over to Hydra. Or Hydra can work as a fast, external analytics database too. It's Postgres-native so no changes are needed to use it in a traditional architecture if you wanted to.

Feel free to DM me on X (@JoeSciarrino) or email founders@ so we can coordinate on the SJC region.

potamic · 62d ago

Compute is not the only bottleneck with running analytics on your transactional DB, there is also storage bandwidth and costs. Which is why once you cross a certain scale, you will want to separate out your analytics use case. I've always been tempted to do analytics over my transactional DB, but knowing that eventually I will need to move it out, felt it's just simpler to separate it out from the start.

Since you're advocating for the opposite, who do you see as your potential customers? Specifically, where in the lifecycle their growth do you see Hydra fit? Is it more for early stage or medium scale companies that can benefit from getting started with analytics quickly or can it continue to cater (at reasonable cost!) as they cross terabyte/petabyte scale?

coatue · 62d ago

[Joe, Hydra cofounder] Yes, you're right and to clarify: Hydra's columnstore is decoupled (bottomless), compressed, and supports multi-node reading. (https://docs.hydra.so/changelog/changelog#march-2025-3)

Events, time-series data, user sessions, click, logs, IOT sensor readings, etc. generate a lot of data over time. While on-disk storage works well for Postgres’ rowstore, it’s a poor choice for fast growing data that requires analysis. To avoid the scale limit of on-disk storage, Hydra separates compute and storage. Also, we're not charging separately for bandwidth since it's been factored into the overall plan price.

While storage volume can be a good proxy, many people see the limits of Postgres with a complex join and filtering on relatively small data volumes. With decoupled columnstore and serverless processing, Hydra can be used in big (and small data) use-cases. Company size is a little less relevant since medium and large-scale companies have use-cases where efficient 'small data' is needed too.

potamic · 62d ago

Okay, this makes sense. But now I'm confused where postgres figures in all this. If your compute is separate and storage is separate, I should just be able to run Hydra independently without postgres?

coatue · 62d ago

Our goal is to enable realtime analytics on Postgres without requiring an external analytics database. Think more towards extending Postgres, rather than replacing it. Postgres brings it's rowstore to Hydra, which is great for transactional jobs. Also, Postgres brings it's syntax, features, and standard Postgres integrations with tools you like to use are the same and works with Hydra. This makes Hydra easy to use and adopt without a major database migration.

thenaturalist · 63d ago

Hey there, congrats on publicly launching this after your work over the past months!

Having followed the project for a while now, I really scratch my head when looking at your pricing.

The entire innovation of the past decade in database land has gone towards decoupling storage and compute, driving query engines (like DuckDB) and file formats (like Iceberg).

Yet you force-bundle storage and compute in your pricing while also selling a serverless product.

What's the reason behind that?

Why do it in the first place?

How does your pricing work?

The 40/ 500 compute hours I get are included in the spend limit per tier (i.e. max 160 additional hours in Starter etc.) or completely separate?

Why are there member constraints on a database product?

How does that factor into cost/ map to SDL / reasonable team setups of people operating analytics projects revolving around a database like yours?

I have never seen such a limit with any other vendor and esp. when you wanna get a hold in the market/ have people start using Hydra for the specialized role it can provide, having a 2 person limit for the minimum tier if I wanna PoC this would likely be a show stopper tbh...

coatue · 63d ago

[Joe, Hydra cofounder] Hey there, I appreciate you taking the time to write this up - helps a lot to hear what's confusing.

One of the downsides of serverless is that it can be difficult to predict the overall monthly cost when the granularity of billing (per invocation, memory usage, or execution time) is complex. For developers this might be totally fine (even preferred), but we think that giving a single, predictable price: Hydra $100 / month is better for businesses to plan around.

Usage caps per plan are purely soft limits so users don't actually encounter them. Yes, we want people to upgrade to higher plans. In the words of Maya Angelou "Be careful when a naked person offers you a shirt" - meaning, we believe these are the best prices we can offer today to build a sustainable project on. That said, I appreciate your point about our # of users limit. If we removed that limit would you try out Hydra?

thenaturalist · 62d ago

Hi Joe, much appreciate the response!

Resounding yes RE removal of user limits.

I would want people to have access, to play around with the tool but also to be able to share responsibility wrt to ops/ extension/ incident mgmt etc.

coatue · 62d ago

Ok, I'm down to run an experiment and remove the user limits on your account! DM me on X (@JoeSciarrino) or email founders@hydra so I know which account is yours.

cultofmetatron · 63d ago

my team is currently looking into offloading some of our analytics data into a columnar database next year. hydra and clickhouse were the top ones on the list. would love a breakdown of how the two compare.

coatue · 63d ago

[Joe, Hydra cofounder] Hey, that's really great - I love hearing that. Hydra is a columnar database with an integrated Postgres rowstore. Analytics aren't purely best on columnar: we've heard from users that their analytics workload would benefit from fast lookup on row tables too, not just scanning large tables. Our goal for Hydra is to enable realtime analytics on Postgres without requiring an external analytics database. This makes it possible to join the rowstore and columnstore data in Postgres with direct SQL. Other analytics databases typically rely on ETL pipelines to move data out of Postgres, which depending on your scale, can become expensive and introduce delay.

cultofmetatron · 63d ago

from what you wrote above, it seems like a great value add for greenfield projects.

we currently use aws aurora. how easy would it be to simply sql dump and load into hydra and how well would it serve as a drop in replacement?

coatue · 63d ago

Close to a drop-in replacement since Aurora bills itself as Postgres. Any data you load into Hydra will automatically be converted into the columnstore! we're happy to help out and feel free to DM me directly.

CaveTech · 62d ago

Current user of Timescale for events processing, with heavy use of materialized views for rolling aggregates.

Is this a use case that you think Hydra would be competetive on?

coatue · 62d ago

Yes definitely. Check out the public 1v1 benchmark of Hydra v Timescale (https://benchmark.clickhouse.com/#eyJzeXN0ZW0iOnsiQWxsb3lEQi...)

mohon · 62d ago

Kudos for the product launch. A bit curious on the product itself, to me the product seems similar to what Neon team does, except Neon doesn't touch the columnar/analytics and just focus on the rowstore. I'm wondering how do you position the product, if let say Neon team (after Databricks acq) decides to support the columnstore format?

moonikakiss · 61d ago

Neon actually does have a columnstore extension with pg_mooncake today. The key difference with pg_mooncake vs hydra is the bet on open storage formats (Iceberg).

(1) https://neon.tech/docs/extensions/pg_mooncake (2) https://www.mooncake.dev/blog/clickbench-v0.1

coatue · 62d ago

[Joe, Hydra cofounder] Hey, thanks! There are similarities, but you’re right to point out that our focus with Hydra is on bringing columnstore-powered serverless analytics to Postgres. We wouldn’t position Hydra differently because we think it’s the right product to help the greatest number of projects and developers in a meaningful way.

mohon · 61d ago

I see. What's the catch on Hydra.so in terms of CAP theorem? I assume it's the C one, especially the docs mentioned about read replica. Is there any drawbacks/tradeoff that user should be aware of?

pikdum · 63d ago

I feel like my ideal would be something more hybrid. It's pretty rare that I have a table that I decide upfront should be columnar. It's a lot more common that I want occasional analytics-like queries on my regular tables to not take forever.

coatue · 63d ago

[Joe, Hydra cofounder] That's good feedback. It's easy to change the default table type to rowstore "heap" (https://docs.hydra.so/guides/analytics#switching-the-default...).

We initiall set the rowstore as default, but people wouldn't create columnstore tables and were confused on why performance wasn't improving. So, figured this was cleaner, but you always have the option to switch the default table type back.

fourseventy · 63d ago

The homepage of this website does a bad job of explaining wtf Hydra actually does. Is it a database? Some type of serverless architecture? Ok analytics, but analytics about what, postgrs performance? Does 'analytics' mean that its for OLAP queries?

coatue · 63d ago

[Joe Hydra cofounder]. Hydra is a fast analytics db on Postgres. It's a database with both a row and columnstore. Analytics can mean reporting, metrics, customer-facing dashboards. Sounds like we should spend some time making analytics templates.

switchbak · 63d ago

I've run through the docs and it's really unclear how the compute model works. "Serverless" is nice, but how exactly is that managed?

mritchie712 · 63d ago

is this using pg_duck?

coatue · 63d ago

[Joe, Hydra cofounder] Hey there, yes - we codeveloped pg_duckdb and it's what Hydra is built on top of!

switchbak · 63d ago

Ory Hydra is a relatively high-profile project with a name collision, FYI.

VWWHFSfQ · 63d ago

there are a million open source products called hydra. I don't think any of them can really claim it exclusively

Show HN: RULER – Easily apply RL to any agent (openpipe.ai)

Show HN: Pangolin – Open source alternative to Cloudflare Tunnels (github.com)

Show HN: Vibe Kanban – Kanban board to manage your AI coding agents (github.com)

Show HN: Interactive pinout for the Raspberry Pi Pico 2 (pico2.pinout.xyz)

Show HN: Open source alternative to Perplexity Comet (browseros.com)

Show HN: Cactus – Ollama for Smartphones (github.com)

Show HN: CXXStateTree – A modern C++ library for hierarchical state machines (github.com)

Show HN: I built a playground to showcase what Flux Kontext is good at (fluxkontextlab.com)

Show HN: An Improvisational Web Server (github.com)

Show HN: OffChess – Offline chess puzzles app (offchess.com)

Show HN: Helices Create a New Model of Deterministic Computation [pdf] (lambdalord.github.io)

Show HN: FlopperZiro – A DIY open-source Flipper Zero clone (github.com)

Show HN: MCP server for searching and downloading documents from Anna's Archive (github.com)

Show HN: Director – Local first, open source MCP Gateway

Show HN: Typeform was too expensive so I built my own forms (ikiform.com)

Show HN: asyncmcp – Run MCP over async transport via AWS SNS+SQS (github.com)

Show HN: BreakerMachines – Modern Circuit Breaker for Rails with Async Support (github.com)

Show HN: Multiple barcodes can be generated on single page (ddddddo.github.io)

Show HN: NodeLoop – Hub for electronics design knowledge and tools (nodeloop.org)

Show HN: Petrichor – a free, open-source, offline music player for macOS (github.com)

Show HN: TUI personal monthly budget planner (github.com)

Show HN: A decentralized command line key-value store on Nostr (github.com)

Show HN: NYC Subway Simulator and Route Designer (buildmytransit.nyc)

Show HN: Ten years of running every day, visualized (nodaysoff.run)

Show HN: AI Movie Finder – I created a way to find movies by describing (aimoviefinder.com)

Show HN: Code is all you need – Sherlog MCP (github.com)

Show HN: I wrote a "web OS" based on the Apple Lisa's UI, with 1-bit graphics (alpha.lisagui.com)

Show HN: Jukebox – Free, Open Source Group Playlist with Fair Queueing (jukeboxhq.com)

Show HN: Virby, a vfkit-based Linux builder for Nix-Darwin (github.com)

Show HN: I rewrote an outdated React Native map clustering library (github.com)

Show HN: A rain Pomodoro with brown noise, ASMR, and Middle Eastern music (forgetoolz.com)

Show HN: From Photos to Positions: Prototyping VLM-Based Indoor Maps (arjo129.github.io)

Show HN: Modernized file manager and program manager from Windows 3.x (github.com)

Show HN: I just deployed GovDocs – which use AI to make SA gov docs searchable (govdocs.co.za)

Show HN: Ossia score – A sequencer for audio-visual artists (github.com)

Show HN: Piano Trainer – Learn piano scales, chords and more using MIDI (github.com)

Show HN: Cursor Rules Generator (cursor-rules-generator.xyz)

Show HN: Unlearning Comparator, a visual tool to compare machine unlearning (gnueaj.github.io)

Show HN: Activiews – A privacy-first fitness alternative for Apple users (activiews.xyz)

Show HN: Stravu – Editable, multi-player AI notebooks with text, tables, diagram

Show HN: I built an AI transcription app because my gf needed one for uni

Show HN: Pyhoff – Connect Python ML Models to Beckhoff/WAGO IO Hardware (github.com)

Show HN: Endorphin AI–Run browser E2E tests from plain English with QA AI agent (endorphinai.dev)

Show HN: I built a tool to solve window management (aboveaverageuser.com)

Show HN: A Chrome Extension to Reveal SaaS Sprawl, Shadow IT, and Waste (hapstack.com)

Show HN: A Language Server Implementation for SystemD Unit Files (github.com)

Show HN: Pg-when– psql extension for creating time values with natural language (github.com)

Show HN: I rebuilt few years old project and now it covers my expenses (hextaui.com)

Show HN: I Built an AI Agent Ecosystem That Optimises Your Google Ads for You (groas.ai)

Show HN: I built an app to turn my kids' questions into podcasts (wonderpods.app)

Show HN: Hydra (YC W22) – Serverless Analytics on Postgres

Comments (33)