Java Virtual Threads Ate My Memory: A Web Crawler's Tale of Speed vs. Memory (dariobalinzo.medium.com)

As the author of a vector search engine I was low key excited for this (there is no good benchmark for vector search out there that resembles real world use even a little, all the vendors have their own internal stuff) but I think using the term "bench" here is a misnomer, it's really more of a pgvector demo app and I don't think you can usefully use it to benchmark anything, at least not out of the box.

tanelpoder · 9h ago

Yeah, I just wanted a cool-sounding name for this. Nevertheless, it allows you to do easy stress-testing with some vector search operations (a quite narrow set, but you can combine it with joins and write your own queries if you like). But "CatStress" didn't sound too good to me.

It's a "Vector Search Playground" really, but the bigger value so far has come from not running maximum stress tests, but demonstrating people how you can join vector search results to the rest of your (existing) application schema. Plenty of people have thought that you need a completely separate, isolated vectorstore behind some API for this...

Edit: Also the setup part includes running a "generate_embeddings.py" script that uses PyTorch under the hood (on CPUs or CUDA/GPUs) to generate embeddings from the 25k photos (or 9M when using the rotated variants). That process can also be sped up and optimized for sure - my whole point is that once everything runs OK enough from end to end, then it's time to start measuring and optimizing the whole process - for learning and fun.

binarymax · 9h ago

https://ann-benchmarks.com is pretty good but I agree it needs an update. I'd like to see modern embedding dimensions (384, 768, 1536, etc.) as well as filters and combined read/write latencies.

jbellis · 9h ago

modern dimensions, yes

mixed workloads, also yes, especially in an "online" environment rather than the "batch mode" that ann-benchmarks does today

but most importantly, multicore -- ann-benchmarks is limited to a single core docker image which is absolutely ludicrous and I suspect is a significant reason that python-based systems do much better in their benchmark than you would expect from trying to deploy them under concurrent loads

binarymax · 9h ago

Indeed! I'm just looking at JVector which I wasn't familiar with - looks cool. Have you tried it with the billion-scale competition? (not sure if that's still running)

jbellis · 8h ago

sort of, there was the original bigann and then they followed up with a couple more specialized contests the following year, i think it's over now

~300M modern-sized vectors is pretty close to jvector's limit in a single index (the Cassandra layer can shard more) https://foojay.io/today/indexing-all-of-wikipedia-on-a-lapto...

that said I think Mariano (new jvector maintainer) is working on ways to handle larger datasets in a single index but I'm not sure where that is on his priority list

Photos taken inside musical instruments (dpreview.com)

Valkey Turns One: Community fork of Redis (gomomento.com)

Surprisingly fast AI-generated kernels we didn't mean to publish yet (crfm.stanford.edu)

Mary Meeker's first Trends report since 2019, focused on AI (bondcap.com)

Reverse engineering of Linear's sync engine (github.com)

Beating Google's kernelCTF PoW using AVX512 (anemato.de)

The ‘white-collar bloodbath’ is all part of the AI hype machine (cnn.com)

Show HN: MCP Defender – OSS AI Firewall for Protecting MCP in Cursor/Claude etc. (mcpdefender.com)

Show HN: Icepi Zero – The FPGA Raspberry Pi Zero Equivalent (github.com)

How large should your sample size be? (vickiboykis.com)

Microsandbox: Virtual Machines that feel and perform like containers (github.com)

Revenge of the Chickenized Reverse-Centaurs (pluralistic.net)

Systems Correctness Practices at Amazon Web Services (cacm.acm.org)

Java Virtual Threads Ate My Memory: A Web Crawler's Tale of Speed vs. Memory (dariobalinzo.medium.com)

Every 5x5 Nonogram (pixelogic.app)

Ray Tracing in J (idle.nprescott.com)

Silicon Valley finally has a big electronics retailer again: Micro Center opens (microcenter.com)

Anthropic launches a voice mode for Claude (techcrunch.com)

The Darwin Gödel Machine: AI that improves itself by rewriting its own code (sakana.ai)

Show HN: Circle Crop Image (circlecropimage.io)

How to run cron jobs in Postgres without extra infrastructure (wasp.sh)

StackAI (YC W23) Is Looking for SWR and Tailwind Wizards (ycombinator.com)

Jerry Lewis's “The Day the Clown Cried” discovered in Sweden after 53 years (thenationalnews.com)

Show HN: Smart Silence – Remind your iPhone to stay quiet in quiet places (testflight.apple.com)

Adam Riess and the Hubble tension (theatlantic.com)

Copy Excel to Markdown Table (and vice versa) (thisdavej.com)

Show HN: Leap – Full-stack AI developer agent that deploys to AWS (leap.new)

De Bruijn notation, and why it's useful (blueberrywren.dev)

A Smiling Public Man (salmagundi.skidmore.edu)

Show HN: W++ – A Python-style scripting language for .NET with NuGet support (github.com)

Radio Astronomy Software Defined Radio (Rasdr) (radio-astronomy.org)

Show HN: Asdf Overlay – High performance in-game overlay library for Windows (github.com)

Why Writing by Hand Is Better for Memory and Learning (scientificamerican.com)

C++ to Rust Phrasebook (cel.cs.brown.edu)

Show HN: Git-Add–Interactive with Enhancements (github.com)

When will M&S take online orders again? (moneyweek.com)

Practical SDR: Getting started with software-defined radio (nostarch.com)

Atomics and Concurrency (redixhumayun.github.io)

Cap: Lightweight, modern open-source CAPTCHA alternative using proof-of-work (capjs.js.org)

Build API integrations with SQL and YAML – no SaaS lock-in, no drag-and-drop UIs (github.com)

Triangle splatting: radiance fields represented by triangles (trianglesplatting.github.io)

Tokenization for language modeling: BPE vs. Unigram Language Modeling (2020) (ndingwall.github.io)

Catbench Vector Search Demo Has Postgres SQL Throughput, Latency Monitoring Now (tanelpoder.com)

Fractran Interpreter (tjwei.github.io)

The radix 2^51 trick (2017) (chosenplaintext.ca)

On eval in dynamic languages generally and in Racket specifically (2011) (blog.racket-lang.org)

Vrs: Personal Software Runtime inspired by Emacs, Plan 9, Erlang, Hypermedia (github.com)

Toxic Origins, Toxic Decisions: Biases in CEO Selection (papers.ssrn.com)

Show HN: MCP Server SDK in Bash (github.com)

Ask HN: What is the best LLM for consumer grade hardware?

Catbench Vector Search Demo Has Postgres SQL Throughput, Latency Monitoring Now

Comments (6)