Will Amazon S3 Vectors Kill Vector Databases–Or Save Them?

30 Fendy 27 9/8/2025, 3:35:46 PM zilliz.com ↗

Comments (27)

simonw · 52m ago

This is a good article and seems well balanced despite being written by someone with a product that directly competes with Amazon S3. I particularly appreciated their attempt to reverse-engineer how S3 Vectors work, including this detail:

> Filtering looks to be applied after coarse retrieval. That keeps the index unified and simple, but it struggles with complex conditions. In our tests, when we deleted 50% of data, TopK queries requesting 20 results returned only 15—classic signs of a post-filter pipeline.

Things like this are why I'd much prefer if Amazon provided detailed documentation of how their stuff works, rather than leaving it to the development community to poke around and derive those details independently.

qaq · 32m ago

"I recently spoke with the CTO of a popular AI note-taking app who told me something surprising: they spend twice as much on vector search as they do on OpenAI API calls. Think about that for a second. Running the retrieval layer costs them more than paying for the LLM itself. That flips the usual assumption on its head." Hmm well start sending full documents as part of context see it flip back :).

heywoods · 18m ago

Egress costs? I’m really surprised by this. Thanks for sharing.

qaq · 1m ago

Sry maybe should've being more clear it was a sarcastic remark. The whole point of doing vector db search is to feed LLM with very targeted context so you can save $ on API calls to LLM.

redskyluan · 17m ago

Author of this article.

Yes, I’m the founder and maintainer of the Milvus project, and also a big fan of many AWS projects, including S3, Lambda, and Aurora. Personally, I don’t consider S3Vector to be among the best products in the S3 ecosystem, though I was impressed by its excellent latency control. It’s not particularly fast, nor is it feature-rich, but it seems to embody S3’s design philosophy: being “good enough” for certain scenarios.

In contrast, the products I’ve built usually push for extreme scalability and high performance. Beyond Milvus, I’ve also been deeply involved in the development of HBase and Oracle products. I hope more people will dive into the underlying implementation of S3Vector—this kind of discussion could greatly benefit both the search and storage communities and accelerate their growth.

redskyluan · 16m ago

By the way, if you’re not fully satisfied with S3Vector’s write, query, or recall performance, I’d encourage you to take a look at what we’ve built with Zilliz Cloud. It may not always be the lowest-cost option, but it will definitely meet your expectations when it comes to latency and recall.

scosman · 28m ago

Anyone interested in this space should look at https://turbopuffer.com - I think they were first to market with S3 backed vector storage, and a good memory cache in front of it.

storus · 48m ago

Does this support hybrid search (dense + sparse embeddings)? Pure dense embeddings aren't that great for specific search, they only hit meaning reliably. Amazon's own embeddings also aren't SOTA.

infecto · 47m ago

That’s where my mind was rolling and also if not, can this be used in OpenSearch hybrid search?

resters · 59m ago

By hosting the vectors themselves, AWS can meta-optimize its cloud based on content characteristics. It may seem like not a major optimization, but at AWS scale it is billions of dollars per year. It also makes it easier for AWS to comply with censorship requirements.

coredog64 · 18m ago

This comment appears to misunderstand the control plane/data plane distinction of AWS. AWS does have limited access to your control plane, primarily for things like enabling your TAMs to analyze your costs or getting assistance from enterprise support teams. They absolutely do not have access to your dataplane unless you specifically grant it. The primary use case for the latter is allowing writes into your storage for things like ALB access logs to S3. If you were deep in a debug session with enterprise support they might request one-off access to something large in S3, but I would be surprised if that were to happen.

resters · 16m ago

If that is the case why create a separate govcloud and HIPAA service?

barbazoo · 45m ago

> It also makes it easier for AWS to comply with censorship requirements.

Does it, how? Why would it be the vector store that would make it easier for them to censor the content? Why not censor the documents in S3 directly, or the entries in the relational database. What is different about censoring those vs a vector store?

resters · 36m ago

Once a vector has been generated (and someone has paid for it) it can be searched for and relevant content can be identified without AWS incurring any additional cost to create its own separate censorship-oriented index, etc. AWS can also add additional bits to the vector that benefit its internal goals (scalability, censorship, etc.)

Not to mention there is lock-in once you've gone to the trouble of using a specific embedding model on a bunch of content. Ideally we'd converge on backwards-compatible, open source approaches, but cloud vendors want to offer "value" by offering "better" embedding models that are not open source.

simonw · 29m ago

Why would they do that? Doesn't sound like something that would attract further paying customers.

Are there laws on the books that would force them to apply the technology in this way?

resters · 21m ago

Not official laws that we can read, but things like that are already in place per the Snowden revelations.

whakim · 16m ago

Regardless of the merits of this argument, dedicated vector databases are all running on top of AWS/GCP/Azure infrastructure anyways.

barbazoo · 33m ago

And that doesn't apply to any other database/search technology AWS offers?

resters · 22m ago

It does to some but not to most of it, which is why Azure and GCP offer nearly the exact same core services.

Fendy · 1h ago

what do you think?

sharemywin · 1h ago

it's annoying to me that there's not a doc store with vectors. seems like the vector dbs just store the vectors I think.

whakim · 28m ago

Elasticsearch and Vespa both fit the bill for this, if your scale grows beyond the purpose-built vector stores.

storus · 42m ago

Pinecone allows 40k of metadata with each vector which is often enough.

simonw · 27m ago

Elasticsearch and MongoDB Atlas and PostgreSQL and SQLite all have vector indexes these days.

intalentive · 1h ago

I just use sqlite

jeffchuber · 1h ago

chroma stores both

nkozyra · 1h ago

As does Azure's AI search.

Not caring enough about money?

Ask HN: How to take notes and learn from them?

Ask HN: Good resources for DIY-ish animatronic kits for Halloween?

Ask HN: How much can we trust open-source projects or our hardware?

Ask HN: Looking for headless CMS recommendation

Ask HN: Who wants to be hired? (September 2025)

Ask HN: How to avoid passive use of AI?

Ask HN: Can an amateur make contributions to pure math or theoretical physics?

Ask HN: Who is hiring? (September 2025)

Raku.org Chooses Htmx

Ask HN: Are api.nasa.gov, data.nasa.gov down or shutdown?

Why the Technological Singularity May Be a "Big Nothing"

Ask HN: Is Reddit going the way of Stack Overflow?

Ask HN: Significant reduction in AI related submissions?

Automated Workday check in/check out and Microsoft Teams messages monitoring

If AI agents take the jobs, who buys the stuff?

Ask HN: Why does Google word privacy settings like you agree even when off?

Ask HN: What do you think of the new Digg?

Ask HN: Moving from Dev to PM

Ask HN: Is your company still hiring junior engineers?

Ask HN: How long did it take you to learn Git?

New Member Alert

Ask HN: LLM struggles to center div too?

Tell HN: My advice after I applied to 450 positions before getting hired

Ask HN: Useful AI applications in regular businesses?

Ask HN: Why do LLMs struggle with word count?

A16Z scouting ambitious Swiss founders for $1M accelerator

ASIC: Proof-of-Concept Binary Optimizer Reduces Size, More to Come

Ask HN: How do you fight YouTube addiction and procrastination? I'm struggling

Ask HN: When was the last time you visited Stack Overflow?

Ask HN: VSCode AI Autocomplete Woes

File protection: anonymous, open source and fast

Will Amazon S3 Vectors Kill Vector Databases–Or Save Them?

Comments (27)