LLM Inevitabilism (tomrenner.com)

If you've ever built web applications with local language models or attempted to experiment with them, you're likely familiar with the challenges: creating custom hooks and UI components from scratch, and building complex integration layers to fall back to server-side models when client-side compatibility is an issue.

To simplify this, I've created 2 new model providers for the Vercel AI SDK that makes it easy to use local and in-browser AI models with a unified API. This allows you to leverage the power of the AI SDK.

It currently supports:

- Chrome/Edge Built-in AI: Leverages the experimental Prompt API in Chrome (Gemini Nano) and Edge (Phi-4-mini) for native performance. It even includes support for multimodal inputs (images and audio), text embeddings and generating structured data.

- WebLLM Integration: Run popular open-source models like Llama 3 and Qwen directly in the browser.

The core idea is to offer a seamless developer experience. You can use the same streamText, generateText, streamObject, generateObject and useChat hook from the Vercel AI SDK, and easily switch to server-side models if the client lacks compatibility.

This is still in its early stages, and I would love to get your feedback, suggestions and help me improve it.

Comments (0)

No comments yet

LLM Inevitabilism (tomrenner.com)

Do not download the app, use the website (idiallo.com)

Kiro: A new agentic IDE (kiro.dev)

CARA – High precision robot dog using rope (aaedmusa.com)

Linux Reaches 5% Desktop Market Share in USA (ostechnix.com)

Show HN: Tinder but it's only pictures of my wife and I can only swipe right (trytender.app)

Valve confirms credit card companies pressured it to delist certain adult games (pcgamer.com)

Hyatt Hotels are using algorithmic Rest “smoking detectors” (twitter.com)

Global hack on Microsoft Sharepoint hits U.S., state agencies, researchers say (washingtonpost.com)

How to Firefox (kau.sh)

Reflections on OpenAI (calv.info)

Qwen3-Coder: Agentic coding in the world (qwenlm.github.io)

AI overviews cause massive drop in search clicks (arstechnica.com)

Graphene OS: a security-enhanced Android build (lwn.net)

It's time for modern CSS to kill the SPA (jonoalderson.com)

Ukrainian hackers destroyed the IT infrastructure of Russian drone manufacturer (prm.ua)

ChatGPT agent: bridging research and action (openai.com)

Mistral Releases Deep Research, Voice, Projects in Le Chat (mistral.ai)

Windsurf employee #2: I was given a payout of only 1% what my shares where worth (twitter.com)

Cops say criminals use a Google Pixel with GrapheneOS – I say that's freedom (androidauthority.com)

The United States withdraws from UNESCO (state.gov)

XMLUI (blog.jonudell.net)

TrackWeight: Turn your MacBook's trackpad into a digital weighing scale (github.com)

My Self-Hosting Setup (codecaptured.com)

Show HN: Shoggoth Mini – A soft tentacle robot powered by GPT-4o and RL (matthieulc.com)

Ozzy Osbourne has died (bbc.co.uk)

Coding with LLMs in the summer of 2025 – an update (antirez.com)

Oakland cops gave ICE license plate data; SFPD also illegally shared with feds (sfstandard.com)

Cloudflare 1.1.1.1 Incident on July 14, 2025 (blog.cloudflare.com)

Steam, Itch.io are pulling ‘porn’ games. Critics say it's a slippery slope (wired.com)

Death by AI (davebarry.substack.com)

You can now disable all AI features in Zed (zed.dev)

New colors without shooting lasers into your eyes (dynomight.net)

Complete silence is always hallucinated as "ترجمة نانسي قنقر" in Arabic (github.com)

UK backing down on Apple encryption backdoor after pressure from US (arstechnica.com)

Ask HN: Is it time to fork HN into AI/LLM and "Everything else/other?"

Apple's MLX adding CUDA support (github.com)

Data brokers are selling flight information to CBP and ICE (eff.org)

Gemini with Deep Think achieves gold-medal standard at the IMO (deepmind.google)

AccountingBench: Evaluating LLMs on real long-horizon business tasks (accounting.penrose.com)

Nobody knows how to build with AI yet (worksonmymachine.substack.com)

Women dating safety app 'Tea' breached, users' IDs posted to 4chan (404media.co)

Electric cars produce less brake dust pollution than combustion-engine cars (modernengineeringmarvels.com)

Ex-Waymo engineers launch Bedrock Robotics to automate construction (techcrunch.com)

Uv: Running a script with dependencies (docs.astral.sh)

What went wrong inside recalled Anker PowerCore 10000 power banks? (lumafield.com)

Cognition (Devin AI) to Acquire Windsurf (cognition.ai)

A 14kb page can load much faster than a 15kb page (2022) (endtimes.dev)

Rust running on every GPU (rust-gpu.github.io)

OpenAI claims gold-medal performance at IMO 2025 (twitter.com)

Show HN: I'm trying to make it easier to run local LLMs directly in the browser

Comments (0)