Show HN: I built a (legit) AI mortgage document analyzer that saves you money (oxford.loan-estimate-analysis.morfi.com)

I'm constantly tempted by the idealism of this experience, but when you factor in the performance of the models you have access to, and the cost of running them on-demand in a cloud, it's really just a fun hobby instead of a viable strategy to benefit your life.

As the hardware continues to iterate at a rapid pace, anything you pick up second-hand will still deprecate at that pace, making any real investment in hardware unjustifiable.

Coupled with the dramatically inferior performance of the weights you would be running in a local environment, it's just not worth it.

I expect this will change in the future, and am excited to invest in a local inference stack when the weights become available. Until then, you're idling a relatively expensive, rapidly depreciating asset.

jeremyjh · 5m ago

I expect it will never change. In two years if there is a local option as good as GPT-5 there will be a much better cloud option and you'll have the same tradeoffs to make.

braooo · 15m ago

Running LLMs at home is a repeat of the mess we make with "run a K8s cluster at home" thinking

You're not OpenAI or Google. Just use pytorch, opencv, etc to build the small models you need.

You don't need Docker even! You can share over a simple code based HTTP router app and pre-shared certs with friends.

You're recreating the patterns required to manage a massive data center in 2-3 computers in your closet. That's insane.

shaky · 46m ago

This is something that I think about quite a bit and am grateful for this write-up. The amount of friction to get privacy today is astounding.

sneak · 9m ago

This writeup has nothing of the sort and is not helpful toward that goal.

mark_l_watson · 9m ago

That is fairly cool. I was talking about this on X yesterday: another angle however, I use a local web scraper and search engine via meilisearch the main tech web sites I am interested in. For my personal research I use three web search APIs, but there is some latency. Having a big chuck of the web that I am interested in available locally with close to zero latency is nice when running local models, my own MCP services that might need web search, etc.

noelwelsh · 41m ago

It's the hardware more than the software that is the limiting factor at the moment, no? Hardware to run a good LLM locally starts around $2000 (e.g. Strix Halo / AI Max 395) I think a few Strix Halo iterations will make it considerably easier.

colecut · 38m ago

This is rapidly improving

https://simonwillison.net/2025/Jul/29/space-invaders/

ramesh31 · 32m ago

>Hardware to run a good LLM locally starts around $2000 (e.g. Strix Halo / AI Max 395) I think a few Strix Halo iterations will make it considerably easier.

And "good" is still questionable. The thing that makes this stuff useful is when it works instantly like magic. Once you find yourself fiddling around with subpar results at slower speeds, essentially all of the value is gone. Local models have come a long way but there is still nothing even close to Claude levels when it comes to coding. I just tried taking the latest Qwen and GLM models for a spin through OpenRouter with Cline recently and they feel roughly on par with Claude 3.0. Benchmarks are one thing, but reality is a completely different story.

ahmedbaracat · 40m ago

Thanks for sharing. Note that the GitHub at the end of the article is not working…

mkagenius · 32m ago

Thanks for the heads up. Its fixed now -

Coderunner-UI: https://github.com/instavm/coderunner-ui

Coderunner: https://github.com/instavm/coderunner

navbaker · 36m ago

Open Web UI is a great alternative for a chat interface. You can point to an OpenAI API like vLLM or use the native Ollama integration and it has cool features like being able to say something like “generate code for an HTML and JavaScript pong game” and have it display the running code inline with the chat for testing

sneak · 9m ago

Halfway through he gives up and uses remote models. The basic premise here is false.

Also, the term “remote code execution” in the beginning is misused. Ironically, remote code execution refers to execution of code locally - by a remote attacker. Claude Code does in fact have that, but I’m not sure if that’s what they’re referring to.

pyman · 24m ago

Mr Stallman? Richard, is that you?

dmezzetti · 33m ago

I built TxtAI with this philosophy in mind: https://github.com/neuml/txtai

Show HN: Trayce – Burp Suite for developers (trayce.dev)

Show HN: Synchrotron, a real-time DSP engine in pure Python (synchrotron.thatother.dev)

Show HN: Aha Domain Search (ahadomainsearch.com)

Show HN: Selfhostllm.org – Plan GPU capacity for self-hosting LLMs (selfhostllm.org)

Show HN: BaaS to build agents as data, not code (github.com)

Show HN: Aegis – A framework for AI-governed software development (github.com)

Show HN: Kitten TTS – 25MB CPU-Only, Open-Source TTS Model (github.com)

Show HN: GPT-5 Document Retrieval – AI Assistant with Inline Citations (smartresearch-ai.com)

Show HN: Bringing Tech News from HN to My Community (sh4jid.me)

Show HN: Browser AI agent platform designed for reliability (github.com)

Show HN: LLM from URL –– A free AI chat completion service directly from URL (818233.xyz)

Show HN: Octofriend, a cute coding agent that can swap between GPT-5 and Claude (github.com)

Show HN: I built a (legit) AI mortgage document analyzer that saves you money (oxford.loan-estimate-analysis.morfi.com)

Show HN: I built a tool that lets you summon AI in any app or website (useinset.com)

Show HN: I built a free fast-paced match-3 game I think you'll like (sqrz.app)

Show HN: Stasher – Burn-after-read secrets from the CLI, no server, no trust (github.com)

Show HN: Starter Repo for Rust CLI with built in database (deebkit.com)

Show HN: Wordle" with the first 1000 Digits of Pi (subjectle.com)

Show HN: I completed mobile fax app in 60 hours

Show HN: Sinkzone DNS – Forwarder that blocks everything except your allowlist (github.com)

Show HN: Aura – Like robots.txt, but for AI actions (github.com)

Show HN: Fiig – Reimagining Google scholar for AI research and PDF annotation (app.ubik.studio)

Show HN: I spent 6 years building a ridiculous wooden pixel display (benholmen.com)

Show HN: I've been building an ERP for manufacturing for the last 3 years (github.com)

Show HN: Whittle – A shrinking word game (playwhittle.com)

Show HN: An open-source e-book reader for conversational reading with an LLM (github.com)

Show HN: When is the next Caltrain? (minimal webapp) (erikschluntz.com)

Show HN: I built a simple tool to automate data into Google Sheets and BigQury (syncrange.com)

Show HN: HMPL – Small Template Language for Rendering UI from Server to Client (github.com)

Show HN: A light GPT-5 vs. Claude Code comparison (charlielabs.ai)

Show HN: Sidequest.js – Background jobs for Node.js using your database (docs.sidequestjs.com)

Show HN: Tiny logic and number games I built for my kids (quizmathgenius.com)

Show HN: FFlags – Feature flags as code, served from the edge (fflags.com)

Show HN: Mathpad – Physical keypad for typing math symbols (crowdsupply.com)

Show HN: Write lead sheets in a Markdown way and transpose in a second (cord.land)

Show HN: Kimu – Open-Source Video Editor (trykimu.com)

Show HN: Stagewise (YC S25) – Front end coding agent for existing codebases (github.com)

Show HN: WebGPU enables local LLM in the browser – demo site with AI chat (andreinwald.github.io)

Show HN: I built an ARG (Alternate Reality Game) for some friends (rishabhegde.substack.com)

Show HN: Student attempt at proving P ≠ NP using geometry and lattices (zenodo.org)

Show HN: My Resume Is a Gameboy (samuel.computer)

Show HN: Creating a Binary Puzzle Game (taengo.vercel.app)

Show HN: FocusTree – a simple task app (prototype), free open source (github.com)

Show HN: ChaCha12-BLAKE3: Secure and Fast committing AEAD Encryption for Any CPU (github.com)

Show HN: Hector Analytics – GDPR-first web analytics with <1KB script (hectoranalytics.com)

Show HN: Rust framework for advanced file recognition and identification (crates.io)

Show HN: Grow a Garden Cooking Recipes (growagardencookingrecipes.pro)

Show HN: I built a tool to replace capcut audio transcription (meetcosmos.com)

Show HN: Mcp-use – Connect any LLM to any MCP (github.com)

Show HN: From Hacking a T480 to the Fastest Open-Hardware 75 Hz E-Ink Display (crowdsupply.com)

I want everything local – Building my offline AI workspace

Comments (15)