Ask HN: Which laptop can run the largest LLM model?

3 grokblah 2 8/14/2025, 4:30:31 PM

I’d like to experiment with LLMs locally and understand their infrastructure better.

Comments (2)

PaulHoule · 2h ago

Don’t the M-series processors for Mac book pro’s have a huge amount of HBM which is good for models? I see you can get a pro with 48MB of unified memory whereas Alienware will sell you a machine with 32GB of regular ram and 24GB of graphics RAM on a 5090 discrete GPU. So the pro has twice the RAM accessible to the GPU.

incomingpain · 1h ago

https://rog.asus.com/us/laptops/rog-flow/rog-flow-z13-2025/s...

The out of stock one has 128gb of unified system ram. AMD 395 ai chip.

So easily run 70B models on that much vram; but slower, probably in that 30-40tokens/s which is very usable.

Qwen 3 30b will be in that 60 tokens/s range.

llama 4 scout will be around 20-30tokens/s

Show HN: Constant Entropy Mixtape Vol.01 (jsr.io)

Show HN: My job search was a mess of spreadsheets, so I built an AI copilot (sagarty.com)

Mississippi may require age verification, Supreme Court says (cnn.com)

Show HN: Happy Coder – End-to-End Encrypted Mobile Client for Claude Code (github.com)

Context with Lars Grammel (Vercel AI SDK, Interview) [video] (youtube.com)

Logistics advice for the independent map seller (docs.google.com)

Most creators will fail in the next 12 months (letters.thedankoe.com)

Claude Code Output Styles (docs.anthropic.com)

Show HN: FirstUser – Exchange reviews for your product launches (firstuser.app)

AI's Serious Python Bias: Concerns of LLMs Preferring One Language (medium.com)

Running Wayland Clients as Non-Root Users on Yocto (embeddeduse.com)

OpenWebRx Turn any SDR device into a Web-based Receiver (openwebrx.de)

Misunderstood "photophoresis" effect could loft metal sheets to exosphere (arstechnica.com)

Safeguarded AI Meeting (johncarlosbaez.wordpress.com)

Anyone else noticing that enterprise support is just ChatGPT/copilot? (old.reddit.com)

Italian unicorn Bending Spoons secures over €500M in debt (sifted.eu)

Russian hackers seized control of Norwegian dam, spy chief says (theguardian.com)

Tensorflow.js Typosquatting Attack (safedep.io)

Migrating from Monolith to Serverless

Scraping the Barrel: Attrition and Cannibalization (bigserge.substack.com)

Show HN: Mapping the Network Behind "The Network State" (evai.ai)

Steve Wozniak: 'I am the happiest person ever' and 'I never sold out' (yro.slashdot.org)

Avalanche Energy hits key milestone on the road to a desktop fusion reactor (techcrunch.com)

Show HN: I made a habit-replacement app in 5 days with 0 coding experience (swaphabit.com)

Equinix signs deals for nukes and fuel cells to power its AI bit barns (theregister.com)

Arm Reveals Plans to Outfit Its GPUs with Dedicated Neural Accelerators (allaboutcircuits.com)

Bluesky disables post search for logged-out users (bsky.app)

In 1983, This Bell Labs Computer Was the First Machine to Become a Chess Master (spectrum.ieee.org)

Reddit in talks to embrace Sam Altman's iris-scanning Orb to verify users (semafor.com)

Gerrit Code Review (gerritcodereview.com)

Ask HN: Is Twilio necessary and required to build text and voice agents?

Illumos is a Unix operating system (illumos.org)

First transfer of behavior between species through single gene manipulation (phys.org)

X Briefly Goes Down Again: Users Report Problems (variety.com)

Slack is the future of agentic work (frontierai.substack.com)

Neon's New Pricing, Explained: Usage-Based with a $5 Minimum (neon.com)

Show HN: I built free Slack app that remembers every incident for on-call heroes (robinrelay.ai)

What are the real numbers, really? (2024) (infinitelymore.xyz)

Should you take collagen? [Or gelatin? ] (economist.com)

We empower communities and nations around the world to map the electrical grid (mapyourgrid.org)

Show HN: Gave Cursor "Vision" with MCP (twitter.com)

Show HN: Added visuals to my 1-second neural network generator (cdeeply.com)

Exploring Foundation Models' Tool-Use Efficacy (osmosis.ai)

Show HN: We made a 2.5GB Offline disaster AI assistant [video] (youtube.com)

SBoM Workbench v1.19 is out

Placing Arguments (blog.yoshuawuyts.com)

My favorite mouse costs less than USD 10 (manualdousuario.net)

Sharded Is Not Distributed: What You Should Know When PostgreSQL Is Not Enough (medium.com)

Re-Architecting AI for Power (semiengineering.com)

Avatarl: Training language models from scratch with pure reinforcement learning (tokenbender.com)

Ask HN: Which laptop can run the largest LLM model?

Comments (2)