AI and All Humanity Books, Standards and Other Things That Worth It

Comments (1)

pasha_sf · 4h ago

Been working on this AI search for a while now, pretty much as a one-man army. Had to handle everything-setting up K8S, buying GPUs, figuring out storage, ensuring reliability with replicas, tweaking the network... you name it. Even built a full-text search from scratch, added OCR for PDFs, and managed chunking, embedding, storing, assessing... It’s like a billion tiny steps that finally led to Space Frontiers.

Oh, and why’d I call it Space Frontiers? Ever since I was a kid, I’ve been obsessed with the idea of creating something like the Borg-you know, a super-intelligence linked to all of humanity’s digital knowledge. I dreamed it’d guide colonial starships to other worlds, help create new life forms, and push consciousness out to the freaking edges of space. Yeah, that dream’s still a long way off, but I’m grinding to make it happen.

Right now, Space Frontiers search runs on just two servers, so it’s kinda slow. But for academic searches-things like Standards, PubMed, Reddit, Telegram, patents, books, and so on-it’s just a query away. The quality of answers seems better than tools like Perplexity in Academic mode, Elicit, or Consensus. It’s not perfect, and sometimes it flops, but other times the results are so cool that it’s become a daily go-to alongside other search engines. Give it a try; hopefully, it’ll impress.

It’s a regular RAG setup with a large database, query reformulation, and expansion. The databases are split into BM25 and vector parts - the first is powered by my own search engine, Summa, and the vector part runs on AlloyDB. Embeds are handled by Jina v3, while chunking and processing come from a ton of our boilerplate code. Things get reranked by a reranker, then chunks are combined, the document set is expanded and reranked again, and finally sent to LLMs with tuned prompts. That’s the gist of it. Oh, and Qwen3 is the king, by the way.

Starting to see how AI paired with a fleet of knowledge sources should be organized, and there’s a ton left to tackle. That’s why there’s a search for a co-founder, pre-seed funding, and anything else needed to turn this dream into reality. If interested, let’s talk.

Rustls Server-Side Performance (memorysafety.org)

How the humble chestnut traced the rise and fall of the Roman Empire (bbc.com)

EvalGen: Helping Developers Create LLM Evals Aligned to Their Preferences (ianarawjo.medium.com)

Visual Studio Code 1.100 adds GPT-4.1 default, custom chat prompts, faster edits (code.visualstudio.com)

Remove Comments to Shorten Prompts (preview--comment-stripping.lovable.app)

Want Apple to add a feature? Pass a law (sixcolors.com)

A Simple Spit Test Could Reveal Prostate Cancer, Outperforming a Blood Test (discovermagazine.com)

URLMint: Finding .com domains that don't suck – with AI (urlmint.com)

Why Moderna Merged Its Tech and HR Departments (wsj.com)

A privately developed Australian rocket is ready for a historic launch (arstechnica.com)

The Curious Case of the Pygmy Nuthatch (slate.com)

Why agency and cognition are fundamentally not computational (frontiersin.org)

Newlyweds receive gift cards before groom's deployment, Amazon won't honor them (whio.com)

Universal Blind and Verifiable Delegated Quantum Computation with Classic Client (researchgate.net)

AI Slop Is Polluting Bug Bounty Platforms with Fake Vulnerability Reports (socket.dev)

High-speed Airo trains are coming to Seattle (axios.com)

Honeybadger adds observability for Elixir, Phoenix, and Oban (honeybadger.io)

iPadOS 17.7.7 issue – apps not saving state / needing logging in (discussions.apple.com)

Why AI Tools Must Sync Memory: A GPT-Replika User's Plea

Chip-based system for terahertz waves could enable imaging, radar, transmission (news.mit.edu)

Klarna CEO says AI helped company shrink workforce by 40% (cnbc.com)

LLMs are making me dumber (vvvincent.me)

A note about the security of your Steam account (store.steampowered.com)

Beyond the tools, adding MCP in VS Code (code.visualstudio.com)

What If Your Salary Is Too High for Today's Job Market? (msn.com)

The first US hub for experimental medical treatments is coming (technologyreview.com)

Meta's Open Molecules 2025 (OMol25) and Universal Model for Atoms (UMA) (ai.meta.com)

Show HN: KawaiiWatch – Making two LLM's fall in love in realtime (kawaii-watch.nightly.pw)

Tinygrad 0.10.3 (github.com)

'Losing Big' Review: Gambling It All Away (msn.com)

As Nuclear Power Makes a Comeback, South Korea Emerges a Winner (bloomberg.com)

Control of Spalling on 8M Mechanized Mining Face in Huoxi Coalfield (mdpi.com)

Netflix will show generative AI ads midway through streams in 2026 (arstechnica.com)

Dev snapshot: Godot 4.5 dev 4 (godotengine.org)

Microsoft Cuts Off Access to Bing Search Data as It Shifts Focus to Chatbots (wired.com)

Microsoft Layoffs Hit Coders Hardest with AI Costs on the Rise (bloomberg.com)

Ask HN: How much have gaps in your work-history affected your getting hired?

Surgeon general pick praised psychedelic therapy, shrooms helped her find love (apnews.com)

reaktiv: Reactive Signals for Python (reaktiv.readthedocs.io)

Would you believe try-catch-finally works in plain old C?

DeltaChat: "We can not hand out data that we don't have" [pdf] (merlinux.eu)

States Chase OpenAI's $100B AI American Dream (washingtonpost.com)

WW3 will be fought with money [video] (youtube.com)

LangGraph Checkpointer on KurrentDB (kurrent.io)

California Governor Newsom seeks to scale back free healthcare for migrants (reuters.com)

World Video Game Hall of Fame Inducts Defender, Tamagotchi, GoldenEye 007, Quake (apnews.com)

Detained Russian-born Harvard scientist criminally charged with smuggling (reuters.com)

The Ego Pandemic (mattruby.substack.com)

OpenAI says OneDrive/SharePoint users can now directly connect files to ChatGPT (windowscentral.com)

A testing question – Testing under uncertainty (third-bit.com)

AI and All Humanity Books, Standards and Other Things That Worth It

Comments (1)