Google Sold This (Search Appliance) [video] (youtube.com)

There is a Prompt API in development that's available in both Chrome and Edge to give access to a local LLM. Chrome extensions have access to it and I believe websites can request access as part of an origin trial.

The model is fully managed by the browser. It's currently the Gemini Nano model on Chrome, and they are testing a version of the Gemma 3n model in beta channels. Edge uses phi-4-mini.

More information is available here: https://github.com/webmachinelearning/prompt-api

maxmcd · 1h ago

Looks like this is a wrapper around: https://github.com/mlc-ai/web-llm

Which has a full web demo: https://chat.webllm.ai/

refulgentis · 27m ago

I am to see it regardless - projects been very low activity for months. Just last night I was thinking about ripping it out before launch. No observable future.

andreinwald · 2h ago

Browser LLM demo working on JavaScript and WebGPU. WebGPU is already supported in Chrome, Safari, Firefox, iOS (v26) and Android.

Demo, similar to ChatGPT https://andreinwald.github.io/browser-llm/

Code https://github.com/andreinwald/browser-llm

- No need to use your OPENAI_API_KEY - its local model that runs on your device

- No network requests to any API

- No need to install any program

- No need to download files on your device (model is cached in browser)

- Site will ask before downloading large files (llm model) to browser cache

- Hosted on Github Pages from this repo - secure, because you see what you are running

cgdl · 9m ago

Which model does the demo use?

petermcneeley · 19m ago

This demo only works if you have the webgpu feature "f16". You can find out if you have this by checking for the feature list in https://webgpureport.org/ . The page itself can of course check for this but since f16 support is common they probably just didnt bother.

andreinwald · 8m ago

Also here https://caniuse.com/webgpu

asim · 25m ago

What's the performance of a model like vs an OpenAI API? What's the comparable here? Edit: I see it's same models locally that you'd run using Ollama or something else. So basically just constrained by the size of the model, GPU and perf of the machine.

pjmlp · 49m ago

Beware of opening this on mobile Internet.

lukan · 21m ago

Well, I am on a mobile right now, can someone maybe share anything about the performance?

andreinwald · 46m ago

Demo site is asking before download

andsoitis · 1h ago

very cool. improvement would be if the input text box is always on screen, rather than having to manually scroll down as the screen fills.

Google Sold This (Search Appliance) [video] (youtube.com)

Creating realistic deepfakes is getting easier, motivating even more AI in reply (apnews.com)

The X11 SECURITY extension from the 1990ies (uninformativ.de)

How the US Weaponized Pakistan Against India (bloomberg.com)

Build a Kanban Board in Minutes with GenosDB (medium.com)

Can it be the best Bug bounty platform? (app.bugseekers.com)

Browser extension and local backend that automatically archives YouTube videos (github.com)

Why Exercise Is a Miracle Drug (derekthompson.org)

How to reverse engineer an analog chip: the TDA7000 FM radio receiver (righto.com)

Show HN: Persisting Data with DuckDB, OPFS and WASM (github.com)

Best Bug Bounty Platform?

Malnourished hostage forced to dig his own grave in Gaza (twitter.com)

Building a SQLite CLI in less than an hour without ever looking at any code (theahura.substack.com)

I Love Shaving Yaks (mijndertstuij.nl)

Felix – Run x86 and x86-64 games on RISC-V Linux (github.com)

Hiding secret codes in light protects against fake videos (news.cornell.edu)

Show HN: Find out how much your products or idea cost (make-stream-bom-demo.vercel.app)

"And" vs. "Then": What words in online reviews tell us about hospital visits (pennmedicine.org)

The quest to detect consciousness – in all its possible forms (nature.com)

The Jackpot Age (greaterwrong.com)

Calendar AI Bot (github.com)

For Sale: 1990 Airstream NASA 025 Command Vehicle. Once-in-a-Lifetime (hemmings.com)

Robotic Hand for Multimodal Observations with Thermal, Inertial, Force Sensors (arxiv.org)

Top MCP Security Risks (and How to Avoid Them) (prefactor.tech)

How Pakistan shot down India's cutting-edge fighter using Chinese gear (reuters.com)

Ana Marie Cox on the Shaky Foundation of Substack as a Business (newsletter.anamariecox.com)

Show HN: I built a stock analysis tool after investing since age 14 (socks2stocks.com)

AI Teammates for for Revenue Teams

May 1, 1969: Fred Rogers testifies before Senate Subcommittee on Communications [video] (youtube.com)

Show HN: I Made LocalStorage Better (npmjs.com)

Random Employee Searches to Resume at NASA HQ (nasawatch.com)

Ask HN: Consumer grade hardware AI sound / music generation models

The Transparency Paradox (link.springer.com)

Texas AI data centers use 463M gallons, residents asked to cut back on showers (msn.com)

Vikings grew barley in Greenland – climate was mild when they landed (sciencenordic.com)

America's Development Boom Meets a Smelly Reality (wsj.com)

Show HN: Predict GPT-5 skills with a community AI benchmark

Show HN: Byline CMS – A TypeScript-First, Open-Source Headless CMS (bylinecms.app)

Show HN: Gomoku game in JavaScript (GitHub and live demo) (github.com)

Show HN: Git-jot – Git notes for branches (github.com)

Cursor's AI coding agent morphed 'into local shell' with one-line prompt attack (cyberscoop.com)

Agentic Coding Things That Didn't Work (lucumr.pocoo.org)

Cheerleading (notes.eatonphil.com)

Show HN: Local audio transcription and speaker ID for Apple Silicon (github.com)

VSCode extension for syntax highlighting multi-line YAML strings (github.com)

The Set-Up-to-Fail Syndrome (1998) (hbr.org)

JavaScript Haikus: My Adventures in Tiny Coding (2023) [video] (youtube.com)

Plague: A Newly Discovered Pam-Based Backdoor for Linux (nextron-systems.com)

Accessing GPT-5 in Perplexity (twitter.com)

Current technology is not ready for proper alpha blending (blog.pkh.me)

WebGPU enables local LLM in the browser. Demo site with AI chat

Comments (12)