Fast (catherinejue.com)

As a backup in case future access changes, I'd like to have sota LLM weights and a machine that can run queries with them in reserve. OpenAI releasing weights is as good a time as any to actually do it. My question is what hardware setup would you buy, that is reasonable accessible (say under 5k, ideally well under) and can do a good job running the models for local queries. And, if it matters, could be suitable for archiving, say until either there is a substantial advance rendering today's LLMs obsolete or until needed because good, open weights aren't available anymore.

Comments (1)

bigyabai · 53m ago

The issue isn't getting LLM weights. Llama taught us that it's basically impossible to prevent people from distributing them if they choose to. You don't have to worry about "future access changes" too much, OpenAI knows there is no "undo" button once they publish weights.

The real issue is having SOTA, collapse-proof hardware for inference. Apple and Nvidia hardware are both reliant on drivers that can brick your server with an over-the-air update. AMD hardware has generally more resilient Mesa drivers that can theoretically survive a hostile OEM, but with fewer options for finetuning and training. Intel GPUs are a high-VRAM option but it's unclear how long they'll be supported in-software for. Everything is a system of tradeoffs.

Fast (catherinejue.com)

Perplexity is using stealth, undeclared crawlers to evade no-crawl directives (blog.cloudflare.com)

Open models by OpenAI (openai.com)

Helsinki records zero traffic deaths for full year (helsinkitimes.fi)

Study mode (openai.com)

Copyparty – Turn almost any device into a file server (github.com)

Show HN: I spent 6 years building a ridiculous wooden pixel display (benholmen.com)

EU age verification app to ban any Android system not licensed by Google (reddit.com)

Genie 3: A new frontier for world models (deepmind.google)

Dumb Pipe (dumbpipe.dev)

‘I witnessed war crimes’ in Gaza – former worker at GHF aid site [video] (bbc.com)

Slow (michaelnotebook.com)

Enough AI copilots, we need AI HUDs (geoffreylitt.com)

Performance and telemetry analysis of Trae IDE, ByteDance's VSCode fork (github.com)

If you're remote, ramble (stephango.com)

Show HN: Draw a fish and watch it swim with the others (drawafish.com)

uBlock Origin Lite now available for Safari (apps.apple.com)

M8.7 earthquake in Western Pacific, tsunami warning issued (earthquake.usgs.gov)

Show HN: Use Their ID – Use your local UK MP’s ID for the Online Safety Act (use-their-id.com)

Modern Node.js Patterns (kashw1n.com)

Vibe code is legacy code (blog.val.town)

Our $100M Series B (oxide.computer)

Face it: you're a crazy person (experimental-history.com)

‘No Other Land’ consultant Awdah Hathaleen killed by Israeli settler (latimes.com)

VPN use surges in UK as new online safety rules kick in (ft.com)

Things that helped me get out of the AI 10x engineer imposter syndrome (colton.dev)

Tom Lehrer has died (nytimes.com)

Sleep all comes down to the mitochondria (science.org)

Telo MT1 (telotrucks.com)

Visa and Mastercard are getting overwhelmed by gamer fury over censorship (polygon.com)

Corporation for Public Broadcasting ceasing operations (cpb.org)

Claude Code weekly rate limits

Job-seekers are dodging AI interviewers (fortune.com)

Claude Opus 4.1 (anthropic.com)

Mastercard deflects blame for NSFW games being taken down (pcgamer.com)

My 2.5 year old laptop can write Space Invaders in JavaScript now (GLM-4.5 Air) (simonwillison.net)

6 weeks of Claude Code (blog.puzzmo.com)

Writing a good design document (grantslatton.com)

Sign in with Google in Chrome (underpassapp.com)

Ollama's new app (ollama.com)

Tao on “blue team” vs. “red team” LLMs (mathstodon.xyz)

How was the Universal Pictures 1936 opening logo created? (movies.stackexchange.com)

Monitor your security cameras with locally processed AI (frigate.video)

Qwen-Image: Crafting with native text rendering (qwenlm.github.io)

4k NASA employees opt to leave agency through deferred resignation program (kcrw.com)

iPhone 16 cameras vs. traditional digital cameras (candid9.com)

The anti-abundance critique on housing is wrong (derekthompson.org)

Live coding interviews measure stress, not coding skills (hadid.dev)

MacBook Pro Insomnia (manuel.bernhardt.io)

We may not like what we become if A.I. solves loneliness (newyorker.com)

Ask HN: Setup for Local LLM Backups?

Comments (1)