Show HN: Local LLM Notepad – run a GPT-style model from a USB stick

3 davidye324 0 6/30/2025, 11:43:37 PM github.com ↗

What it is A single 45 MB Windows .exe that embeds llama.cpp and a minimal Tk UI. Copy it (plus any .gguf model) to a flash drive, double-click on any Windows PC, and you’re chatting with an LLM—no admin rights, Cloud, or network.

Why I built it Existing “local LLM” GUIs assume you can pip install, pass long CLI flags, or download GBs of extras.

I wanted something my less-technical colleagues could run during a client visit by literally plugging in a USB drive.

How it works PyInstaller one-file build → bundles Python runtime, llama_cpp_python, and the UI into a single PE.

On first launch, it memory-maps the .gguf; subsequent prompts stream at ~20 tok/s on an i7-10750H with gemma-3-1b-it-Q4_K_M.gguf (0.8 GB).

Tick-driven render loop keeps the UI responsive while llama.cpp crunches.

A parser bold-underlines every token that originated in the prompt; Ctrl+click pops a “source viewer” to trace facts. (Helps spot hallucinations fast.)

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights (jameshard.ing)

Gemini CLI (blog.google)

IDF officers ordered to fire at unarmed crowds near Gaza food distribution sites (haaretz.com)

More on Apple's Trust-Eroding 'F1 the Movie' Wallet Ad (daringfireball.net)

JavaScript Trademark Update (deno.com)

Writing toy software is a joy (blog.jsbarretto.com)

MCP: An (Accidentally) Universal Plugin System (worksonmymachine.substack.com)

OpenAI charges by the minute, so speed up your audio (george.mand.is)

Engineered Addictions (masonyarbrough.substack.com)

A new PNG spec (programmax.net)

A new pyramid-like shape always lands the same side up (quantamagazine.org)

I made my VM think it has a CPU fan (wbenny.github.io)

Fun with uv and PEP 723 (cottongeeks.com)

Man 'refused entry into US' as border control catch him with bald JD Vance meme (dublinlive.ie)

Thnickels (thick-coins.net)

I deleted my second brain (joanwestenberg.com)

A new PNG spec (programmax.net)

Define policy forbidding use of AI code generators (github.com)

-2000 Lines of code (2004) (folklore.org)

AlphaGenome: AI for better understanding the genome (deepmind.google)

Facebook is asking to use Meta AI on photos you haven’t yet shared (theverge.com)

What Problems to Solve (1966) (genius.cat-v.org)

Microsoft Edit (github.com)

Starship: A minimal, fast, and customizable prompt for any shell (starship.rs)

PlasticList – Plastic Levels in Foods (plasticlist.org)

Games run faster on SteamOS than Windows 11, Ars testing finds (arstechnica.com)

US Supreme Court limits federal judges' power to block Trump orders (theguardian.com)

Introducing Gemma 3n (developers.googleblog.com)

Many ransomware strains will abort if they detect a Russian keyboard installed (2021) (krebsonsecurity.com)

Finding a 27-year-old easter egg in the Power Mac G3 ROM (downtowndougbrown.com)

Alternative Layout System (alternativelayoutsystem.com)

Gridfinity: The modular, open-source grid storage system (gridfinity.xyz)

XSLT – Native, zero-config build system for the Web (github.com)

Puerto Rico's Solar Microgrids Beat Blackout (spectrum.ieee.org)

US economy shrank 0.5% in the first quarter, worse than earlier estimates (apnews.com)

Ask HN: What Are You Working On? (June 2025)

Basic Facts about GPUs (damek.github.io)

JWST reveals its first direct image discovery of an exoplanet (smithsonianmag.com)

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok (github.com)

Getting ready to issue IP address certificates (community.letsencrypt.org)

Build and Host AI-Powered Apps with Claude – No Deployment Needed (anthropic.com)

The new skill in AI is not prompting, it's context engineering (philschmid.de)

Launch HN: Issen (YC F24) – Personal AI language tutor

Loss of key US satellite data could send hurricane forecasting back 'decades' (theguardian.com)

ChatGPT's enterprise success against Copilot fuels OpenAI/Microsoft rivalry (bloomberg.com)

We ran a Unix-like OS on our home-built CPU with a home-built C compiler (2020) (fuel.edby.coffee)

The bitter lesson is coming for tokenization (lucalp.dev)

The $25k car is going extinct? (media.hubspot.com)

National Archives at College Park, MD, will become a restricted federal facility (archives.gov)

Solving `Passport Application` with Haskell (jameshaydon.github.io)

Show HN: Local LLM Notepad – run a GPT-style model from a USB stick

Comments (0)