NPM debug and chalk packages compromised (aikido.dev)

This is a good article and seems well balanced despite being written by someone with a product that directly competes with Amazon S3. I particularly appreciated their attempt to reverse-engineer how S3 Vectors work, including this detail:

> Filtering looks to be applied after coarse retrieval. That keeps the index unified and simple, but it struggles with complex conditions. In our tests, when we deleted 50% of data, TopK queries requesting 20 results returned only 15—classic signs of a post-filter pipeline.

Things like this are why I'd much prefer if Amazon provided detailed documentation of how their stuff works, rather than leaving it to the development community to poke around and derive those details independently.

qaq · 13m ago

"I recently spoke with the CTO of a popular AI note-taking app who told me something surprising: they spend twice as much on vector search as they do on OpenAI API calls. Think about that for a second. Running the retrieval layer costs them more than paying for the LLM itself. That flips the usual assumption on its head." Hmm well start sending full documents as part of context see it flip back :).

scosman · 9m ago

Anyone interested in this space should look at https://turbopuffer.com - I think they were first to market with S3 backed vector storage, and a good memory cache in front of it.

resters · 40m ago

By hosting the vectors themselves, AWS can meta-optimize its cloud based on content characteristics. It may seem like not a major optimization, but at AWS scale it is billions of dollars per year. It also makes it easier for AWS to comply with censorship requirements.

barbazoo · 26m ago

> It also makes it easier for AWS to comply with censorship requirements.

Does it, how? Why would it be the vector store that would make it easier for them to censor the content? Why not censor the documents in S3 directly, or the entries in the relational database. What is different about censoring those vs a vector store?

resters · 17m ago

Once a vector has been generated (and someone has paid for it) it can be searched for and relevant content can be identified without AWS incurring any additional cost to create its own separate censorship-oriented index, etc. AWS can also add additional bits to the vector that benefit its internal goals (scalability, censorship, etc.)

Not to mention there is lock-in once you've gone to the trouble of using a specific embedding model on a bunch of content. Ideally we'd converge on backwards-compatible, open source approaches, but cloud vendors want to offer "value" by offering "better" embedding models that are not open source.

simonw · 10m ago

Why would they do that? Doesn't sound like something that would attract further paying customers.

Are there laws on the books that would force them to apply the technology in this way?

resters · 2m ago

Not official laws that we can read, but things like that are already in place per the Snowden revelations.

barbazoo · 14m ago

And that doesn't apply to any other database/search technology AWS offers?

resters · 3m ago

It does to some but not to most of it, which is why Azure and GCP offer nearly the exact same core services.

storus · 29m ago

Does this support hybrid search (dense + sparse embeddings)? Pure dense embeddings aren't that great for specific search, they only hit meaning reliably. Amazon's own embeddings also aren't SOTA.

infecto · 28m ago

That’s where my mind was rolling and also if not, can this be used in OpenSearch hybrid search?

Fendy · 1h ago

what do you think?

sharemywin · 1h ago

it's annoying to me that there's not a doc store with vectors. seems like the vector dbs just store the vectors I think.

whakim · 9m ago

Elasticsearch and Vespa both fit the bill for this, if your scale grows beyond the purpose-built vector stores.

simonw · 9m ago

Elasticsearch and MongoDB Atlas and PostgreSQL and SQLite all have vector indexes these days.

storus · 23m ago

Pinecone allows 40k of metadata with each vector which is often enough.

intalentive · 47m ago

I just use sqlite

jeffchuber · 51m ago

chroma stores both

nkozyra · 48m ago

As does Azure's AI search.

NPM debug and chalk packages compromised (aikido.dev)

95% of AI Pilots Fail (selector.ai)

Signal Secure Backups (signal.org)

Job Mismatch and Early Career Success (nber.org)

Experimenting with Local LLMs on macOS (blog.6nok.org)

Dietary omega-3 polyunsaturated fatty acids as a protective factor of myopia (bjo.bmj.com)

Clankers Die on Christmas (remyhax.xyz)

OpenWrt: A Linux OS targeting embedded devices (openwrt.org)

Our data shows San Francisco tech workers are working Saturdays (ramp.com)

Will Amazon S3 Vectors Kill Vector Databases–Or Save Them? (zilliz.com)

Google gets away almost scot-free in US search antitrust case (computerworld.com)

Building an acoustic camera with UMA-16 and Acoular (minidsp.com)

Meta suppressed research on child safety, employees say (washingtonpost.com)

Browser Fingerprint Detector (fingerprint.goldenowl.ai)

Immich – High performance self-hosted photo and video management solution (github.com)

A complete map of the Rust type system (rustcurious.com)

14 Killed in anti-government protests in Nepal (tribuneindia.com)

Firefox 32-bit Linux Support to End in 2026 (blog.mozilla.org)

Using Claude Code to modernize a 25-year-old kernel driver (dmitrybrant.com)

RSS Beat Microsoft (buttondown.com)

The MacBook has a sensor that knows the exact angle of the screen hinge (twitter.com)

What if artificial intelligence is just a "normal" technology? (economist.com)

Why Is Japan Still Investing in Custom Floating Point Accelerators? (nextplatform.com)

VMware's in court again. Customer relationships rarely go this wrong (theregister.com)

American Flying Empty Airbus A321neo Across the Atlantic 20 Times (onemileatatime.com)

We Rarely Lose Technology (2023) (hopefulmons.com)

'We can do it for under $100M': Startup joins race to build local ChatGPT (afr.com)

Formatting code should be unnecessary (maxleiter.com)

Writing by manipulating visual representations of stories (github.com)

Indiana Jones and the Last Crusade Adventure Prototype Recovered for the C64 (gamesthatwerent.com)

GPT-5 Thinking in ChatGPT (a.k.a. Research Goblin) is good at search (simonwillison.net)

Integer Programming (2002) [pdf] (web.mit.edu)

Intel Arc Pro B50 GPU Launched at $349 for Compact Workstations (guru3d.com)

Look Out for Bugs (matklad.github.io)

Creative Technology: The Sound Blaster (abortretry.fail)

How inaccurate are Nintendo's official emulators? [video] (youtube.com)

Analog optical computer for AI inference and combinatorial optimization (nature.com)

How many dimensions is this? (lcamtuf.substack.com)

Microdosing GLP-1 drugs became a longevity 'craze' (washingtonpost.com)

Exploring Grid-Aware Websites (nicchan.me)

Steve Jobs and NeXT Part 2: The Long Road to Mac OS X [video] (youtube.com)

How many SPARCs is too many SPARCs? (thejpster.org.uk)

Garmin beats Apple to market with satellite-connected smartwatch (macrumors.com)

What is the origin of the private network address 192.168.*.*? (2009) (lists.ding.net)

Writing Code Is Easy. Reading It Isn't (idiallo.com)

Taking Buildkite from a side project to a global company (valleyofdoubt.com)

The Spectacular Comeback Tour of Ross Ulbricht (nytimes.com)

No Silver Bullet: Essence and Accidents of Software Engineering (1986) [pdf] (cs.unc.edu)

How to make metals from Martian dirt (csiro.au)

After nearly half a century in deep space, every ping from Voyager 1 is a bonus (theregister.com)

Will Amazon S3 Vectors Kill Vector Databases–Or Save Them?

Comments (20)

What is the origin of the private network address 192.168..? (2009) (lists.ding.net)