Show HN: OctaneDB – Fast, Open-Source Vector Database for Python

24 rijin_r 6 8/23/2025, 6:31:00 AM github.com ↗
OctaneDB is an open-source vector database for Python that focuses on ultra-fast similarity search for high-dimensional data—perfect for AI/ML, semantic search, and large-scale document or embedding retrieval.

What does it do?

Store, index, and search millions of embeddings (text, images, etc.) with sub-millisecond query time.

Supports in-memory and efficient HDF5 persistent storage.

Integrates seamlessly with sentence-transformers for automatic text embedding.

Key Features:

10x faster than Pinecone or ChromaDB for vector search and batch insertions.

Advanced indexing: HNSW (approximate nearest neighbor), FlatIndex

Batch search, advanced metadata filtering, GPU acceleration

Comments (6)

icemanx · 11m ago
Definitely vibe code, and I agree with the others - please mention it in the README if you want people to use AI generated code. You are not the mind behind it

Update: Looking through your code, I already found within 5 minute flaws, since it is obvious that you have not written it (looking at your past work, I doubt you even understand what it does), I will not even point the issues out.

ozgrakkurt · 1h ago
Please stop with these LLM generated readme files. It burns the eyes.

Just take 10 minutes to write something that explains the project.

As soon as I see this README I assume the code is generated(worthless) as well

ImPrajyoth · 1h ago
Someone wise once said

- Don't judge a book by it's cover

mfitzp · 55m ago
Someone even wiser once said:

- Don’t give your book a shit cover because it will put people off.

Bishonen88 · 1h ago
Looking at the repo, everything seems too clean and pristine. There's only 5 commits. Not that it matters, but was the whole thing vibe coded? If so, I wonder if mentioning this in the Readme would be helpful to potential users.
JSR_FDED · 1h ago
This looks very good, easy to understand. Do you have a sense for how much RAM it uses (not for storing the vectors themselves - those I’ll keep in a file), but when doing a search? I have one use case where it could run on a VPS provided its memory use doesn’t balloon too much.