Launch HN: Issen (YC F24) – Personal AI language tutor

70 points by mariano54 1h ago 48 comments

A Review of Aerospike Nozzles: Current Trends in Aerospace Applications (mdpi.com)

13 points by PaulHoule 49m ago 6 comments

Show HN: I built an AI dataset generator (github.com)

26 points by matthewhefferon 1h ago 5 comments

FLUX.1 Kontext [Dev] – Open Weights for Image Editing (bfl.ai)

27 points by minimaxir 51m ago 6 comments

A new pyramid-like shape always lands the same side up (quantamagazine.org)

541 points by robinhouston 20h ago 135 comments

I built an ADHD app with interactive coping tools, noise mixer and self-test (adhdhelp.app)

7 points by digitalions 2h ago 3 comments

Puerto Rico's Solar Microgrids Beat Blackout (spectrum.ieee.org)

256 points by ohjeez 16h ago 138 comments

Muvera: Making multi-vector retrieval as fast as single-vector search (research.google)

51 points by georgehill 5h ago 2 comments

-2000 Lines of code (folklore.org)

454 points by xeonmc 20h ago 187 comments

Learnings from building AI agents (cubic.dev)

99 points by pomarie 3h ago 31 comments

Snow - Classic Macintosh emulator (snowemu.com)

115 points by ColinWright 7h ago 43 comments

What makes comprehensible input comprehensible? (cij-analysis.streamlit.app)

12 points by surprisetalk 3d ago 1 comments

OpenAI charges by the minute, so speed up your audio (george.mand.is)

653 points by georgemandis 1d ago 194 comments

The Business of Betting on Catastrophe (thereader.mitpress.mit.edu)

9 points by anarbadalov 3d ago 0 comments

Real-world performance comparison of ebtree/cebtree/rbtree (wtarreau.blogspot.com)

29 points by misonic 2d ago 2 comments

Ambient Garden (ambient.garden)

182 points by fipar 2d ago 34 comments

Structured Output with LangChain and Llamafile (blog.brakmic.com)

23 points by brakmic 3d ago 12 comments

Modeling the World in 280 Characters (tympanus.net)

64 points by OuterVale 3d ago 9 comments

Writing a basic Linux device driver when you know nothing about Linux drivers (crescentro.se)

356 points by sbt567 4d ago 50 comments

Better Auth, by a self-taught Ethiopian dev, raises $5M from Peak XV, YC (techcrunch.com)

209 points by bundie 22h ago 153 comments

AccessOwl (YC S22) is hiring an Elixir Engineer to connect 100s of SaaS (ycombinator.com)

1 points by mathiasn 9h ago 0 comments

RSS Server Side Reader (matklad.github.io)

27 points by Bogdanp 4h ago 19 comments

LLM code generation may lead to an erosion of trust (jaysthoughts.com)

141 points by CoffeeOnWrite 10h ago 162 comments

Swift – Announes the Android Workgroup (forums.swift.org)

3 points by desertmonad 2h ago 1 comments

What Problems to Solve (1966) (genius.cat-v.org)

423 points by jxmorris12 23h ago 52 comments

Build and Host AI-Powered Apps with Claude – No Deployment Needed (anthropic.com)

278 points by davidbarker 23h ago 115 comments

Apptainer: Application Containers for Linux (apptainer.org)

92 points by cl3misch 6h ago 58 comments

America’s incarceration rate is in decline (theatlantic.com)

218 points by paulpauper 23h ago 393 comments

Howdy – Windows Hello style facial authentication for Linux (github.com)

56 points by LorenDB 2d ago 28 comments

Define policy forbidding use of AI code generators (github.com)

463 points by todsacerdoti 16h ago 324 comments

MCP in LM Studio (lmstudio.ai)

216 points by yags 22h ago 120 comments

The Art of Hanakami, or Flower-Petal Folding (origamiusa.org)

58 points by s4074433 3d ago 2 comments

The Offline Club (theoffline-club.com)

171 points by esher 20h ago 88 comments

Gemini CLI (blog.google)

1332 points by sync 1d ago 729 comments

Getting by on the Generosity of Strangers in Japan (theworld.org)

68 points by ilamont 2d ago 28 comments

The first non-opoid painkiller (worksinprogress.news)

201 points by ortegaygasset 7h ago 152 comments

Bot or human? Creating an invisible Turing test for the internet (research.roundtable.ai)

126 points by timshell 1d ago 156 comments

I fought in Ukraine and here's why FPV drones kind of suck (warontherocks.com)

112 points by _tk_ 6h ago 163 comments

A new PNG spec (programmax.net)

611 points by bluedel 2d ago 554 comments

Web Embeddable Common Lisp (turtleware.eu)

136 points by todsacerdoti 1d ago 44 comments

Iroh: A library to establish direct connection between peers (github.com)

230 points by gasull 23h ago 51 comments

The symbol of earthly good, and the immediate object of toil (crookedtimber.org)

18 points by akkartik 3d ago 3 comments

Interstellar Flight: Perspectives and Patience (centauri-dreams.org)

91 points by JPLeRouzic 23h ago 157 comments

Libxml2's "no security embargoes" policy (lwn.net)

267 points by jwilk 20h ago 230 comments

Is Lovable getting monetization wrong? (getlago.substack.com)

123 points by FinnLobsien 1d ago 70 comments

RaptorCast: Designing a Messaging Layer (category.xyz)

43 points by wwolffrec 3d ago 15 comments

Microsoft Dependency Has Risks (blog.miloslavhomer.cz)

139 points by ArcHound 20h ago 194 comments

Games run faster on SteamOS than Windows 11, Ars testing finds (arstechnica.com)

372 points by _JamesA_ 20h ago 207 comments

'Sticky thinking' hampers decisions in depression (bps.org.uk)

102 points by domofutu 2d ago 57 comments

Getting ready to issue IP address certificates (community.letsencrypt.org)

313 points by Bogdanp 1d ago 170 comments

Muvera: Making multi-vector retrieval as fast as single-vector search

51 georgehill 2 6/26/2025, 10:29:34 AM research.google ↗

Comments (2)

dinobones · 5m ago

So this is basically an “embedding of embeddings”, an approximation of multiple embeddings compressed into one, to reduce dimensionality/increase performance.

All this tells me is that: the “multiple embeddings” are probably mostly overlapping and the marginal value of each additional one is probably low, if you can represent them with a single embedding.

I don’t otherwise see how you can keep comparable performance without breaking information theory.

trengrj · 2h ago

We added Muvera to Weaviate recently https://weaviate.io/blog/muvera and also have a nice podcast on it https://www.youtube.com/watch?v=nSW5g1H4zoU.

When looking at multi-vector / ColBERT style approaches, the embedding per token approach can massively increase costs. You might go from a single 768 dimension vector to 128 x 130 = 16,640 dimensions. Even with better results from a multi-vector model this can make it unfeasible for many use-cases.

Muvera, converts the multiple vectors into a single fixed dimension (usually net smaller) vector that can be used by any ANN index. As you now have a single vector you can use all your existing ANN algorithms and stack other quantization techniques for memory savings. In my opinion it is a much better approach than PLAID because it doesn't require specific index structures or clustering assumptions and can achieve lower latency.