Gemini Live providing real-time coaching for golf over WebRTC (twitter.com)

2 points by Nash0x7e2 5m ago 1 comments

'Mental time travel' can restore memories to their former state, new study finds (medicalxpress.com)

2 points by felineflock 5m ago 0 comments

'Fair Use' Prevails as Library of Congress Wins DMCA Anti-Circumvention Battle (torrentfreak.com)

2 points by pabs3 10m ago 0 comments

New SEC rule lets investors redeem Bitcoin, Ethereum directly from ETFs (thestreet.com)

2 points by hbcondo714 11m ago 0 comments

Metis List: World’s Top AI Researchers (metislist.com)

2 points by masterjack 14m ago 0 comments

A literary prize judged by AI (themorenoprize.org)

3 points by westernaccess 14m ago 1 comments

Alpine.js 16kb React Alternative (alpinejs.dev)

2 points by cloudking 16m ago 0 comments

Ask HN: What Interests You in Crypto?

2 points by bix6 19m ago 0 comments

M8.7 Earthquake in Western Pacific, Tsunami Warning Issued (earthquake.usgs.gov)

5 points by jandrewrogers 22m ago 0 comments

Show HN: I built an app to collect and manage user feedback for startups and dev (fidbaq.xyz)

2 points by averadev 25m ago 0 comments

Double-slit experiment holds up when stripped to its quantum essentials (news.mit.edu)

2 points by bookofjoe 25m ago 0 comments

CBS News investigation of Jeffrey Epstein jail video reveals new discrepancies (cbsnews.com)

7 points by ivape 27m ago 0 comments

Show HN: AI-ready portfolio with 40 personality modules for strategic system use

2 points by Devinlam 28m ago 0 comments

Eleven Missing Terraform Features (josnyder.com)

3 points by oftenwrong 29m ago 0 comments

Show HN: Empromptu.ai – No code, AI app builder with RAG, model, evals etc.

2 points by anaempromptu 33m ago 0 comments

Novo Nordisk Shares Plummet as Competition Weighs on Sales (wsj.com)

3 points by JumpCrisscross 33m ago 0 comments

DIY Dual-Screen Cyberdeck: Sleek Design, Ultimate Functionality [video] (youtube.com)

3 points by learnedbytes 35m ago 0 comments

Looking for feedback on my podcast for early stage founders (youtube.com)

2 points by itsmechase 38m ago 1 comments

Get Latest Startup Funding Data (justraisedfunding.com)

3 points by chiswanjo 39m ago 0 comments

Can banks individually create money out of nothing? (2014) (sciencedirect.com)

5 points by RyanShook 41m ago 0 comments

US labor activist Chris Smalls assaulted by IDF during Gaza aid trip, group says (theguardian.com)

12 points by NomDePlum 45m ago 1 comments

Are all residential proxy services criminal organizations? (hcaptcha.com)

5 points by amirhirsch 48m ago 2 comments

Sensitive Compartmented Information Facility (en.wikipedia.org)

3 points by handfuloflight 50m ago 0 comments

Starter Code for Agentic Systems (github.com)

4 points by lorenstewart 51m ago 4 comments

Radxa E24C and E54C: fanless PCs with 4 GigE ports and Rockchip processors (liliputing.com)

2 points by PaulHoule 52m ago 0 comments

Earthquake of magnitude 8 strikes off Russia's Kamchatka (reuters.com)

8 points by david927 56m ago 2 comments

Mar-a-Lago Accord? (corporate.nordea.com)

3 points by TMWNN 56m ago 0 comments

US Space Force scheduled to launch eighth X-37B mission (spaceforce.mil)

2 points by TMWNN 56m ago 0 comments

ChatGPT Gave Instructions for Murder, Self-Mutilation, and Devil Worship (theatlantic.com)

1 points by bretpiatt 58m ago 0 comments

SSRIs as Cancer Therapy (science.org)

3 points by EA-3167 59m ago 0 comments

From Ontologies to Agents: The Semantic Web's Quiet Rebirth (seanfalconer.medium.com)

2 points by sfalc 1h ago 0 comments

YouTube to Be Included in Australia's Teen Social Media Ban (bloomberg.com)

6 points by mfiguiere 1h ago 1 comments

We have appended an Editors' Note to a story about Mohammed Zakaria al-Mutawaq (twitter.com)

2 points by nailer 1h ago 0 comments

Launching Maybind (maybind.com)

3 points by alessandro_may 1h ago 1 comments

Show HN: Robotics AI Cells – Modular AI Functions for Faster Robot Programming (github.com)

1 points by aemiliotis 1h ago 0 comments

Notes on Spite (hollisrobbinsanecdotal.substack.com)

2 points by jger15 1h ago 0 comments

Show HN: Zkshare – PIN protected secret sharing with client-side encryption (github.com)

1 points by streetsmartai 1h ago 0 comments

In a digital age, old-fashioned watchmaking schools are in demand (latimes.com)

3 points by dangle1 1h ago 0 comments

Should I sell my SaaS or keep building? (startupidealab.io)

1 points by wasayybuildz 1h ago 1 comments

Someone made a 128000 line PR to opencut (github.com)

58 points by agtestdvn 1h ago 42 comments

PHP-ORT: Machine Learning Inference for the Web (krakjoe.github.io)

3 points by Bogdanp 1h ago 1 comments

YouTube to be included in social media ban for under 16 after exemption reversed (abc.net.au)

6 points by thomasfromcdnjs 1h ago 3 comments

Writing a Text Editor [video] (youtube.com)

1 points by tambourine_man 1h ago 0 comments

Early universe's 'little red dots' may be black hole stars (science.org)

1 points by bikenaga 1h ago 0 comments

Ask HN: Is OpenAI charging me for free tokens?

1 points by prats226 1h ago 1 comments

From Counterculture to Cyberculture: Interview with Fred Turner (jasmi.news)

1 points by taiwandongsuan 1h ago 0 comments

Get Found in AI Search (ChatGPT etc.) – Free AI Audit Tool (searchshift.ai)

2 points by Scotty108 1h ago 1 comments

tcmalloc's Temeraire: A Hugepage-Aware Allocator (paulcavallaro.com)

1 points by matt_d 1h ago 0 comments

Statistics Every Programmer Needs (manning.com)

1 points by teleforce 1h ago 0 comments

Reddit Popularity Made Our Simulated Conscious AI Crash and How We Scaled Fast (dreami.me)

1 points by zuda 1h ago 0 comments

Ask HN: Anyone is using Linux machine for local inference?

2 throwaw12 2 7/29/2025, 11:11:16 AM

Hey there,

Is anyone here using Linux machine with 256Gb or 512Gb RAM to run latest models locally?

I am considering buying a new laptop/desktop to run models locally. Most benchmarks I see are for Mac Mx series chips with MLX, even then for big models (>300B param) people are using quantized versions (3bit, 4bit) and its causing drop in quality.

If anyone used Linux with >256Gb ram and no dedicated GPU, how is your experience?

Comments (2)

compressedgas · 11h ago

Running LLMs on CPU only is too slow.

incomingpain · 11h ago

Ive tried this with deepseek r1, i got about 2 tokens/second and each response took 10-15 minutes to reply.

The cost of that hardware was free to me, but to build this yourself would be thousands. You might as well just hit up an api: https://openrouter.ai/deepseek/deepseek-r1-0528/providers

Even if you hammer it, it'll only be $10.

>Most benchmarks I see are for Mac Mx series chips with MLX

Mac mini pro with 64gb of ram is actually suspiciously good value. Somehting like $4000... bit high but it can be your workstation.

The gpu and system memory are unified so you can load up bigger models. It's not the same speeds as high end gpus, but it's also not the same power draw. You'll stick to under 200watts.

Obviously 64GB doesnt let you run full deepseek or similar neither; but those 32B-70B models are ideal anyway.

At a bit cheaper price, there are minipcs with AMD Ryzen™ AI Max+ 395. Same idea as the mac mini; and you can get 64-128GB of ram. Intel has a similar chip.

You'll get 15-20 tokens/s from 32B. Which is slow if you're coding.

Now, you could look into high end gpus, get a server mobo with 10 pcie slots, load it up with 16GB cards. Have 160GB of vram. But you'll need special electrical plugs; it'll idle at like 600watts, costing $100/month. But man that thing would be great, so fast.