Show HN: Run AI models directly in the browser – no server or internet required (private-ai-chat.vercel.app)

This makes Perplexity look really bad. This isn't an advanced attack; this is LLM security 101. It seems like they have nobody thinking about security at all, and certainly nobody assigned to security.

Disclosure: I work on LLM security for Google.

rvz · 49m ago

Agreed.

This is really an amateur-level attack even after all this VC money and 'top engineers' not even thinking about basic LLM security for an "AI" company makes me question whether if their abilities are inflated / exaggerated or both.

Maybe Perplexity 'vibe coded' the features in their browser with no standard procedure for security compliance or testing.

Shameful.

ElectronShak · 20m ago

Maybe we need a CORS spec for llms?

veganmosfet · 7h ago

As possible mitigation, they mention "The browser should distinguish between user instructions and website content". I don't see how this can be achieved in a reliable way with LLMs tbh. You can add fancy instructions (e.g., "You MUST NOT...") and delimiters (e.g., "<non_trusted>") and fine-tune the LLM but this is not reliable, since instructions and data are processed in the same context and in the same way. There are 100s of examples out there. The only reliable countermeasures are outside the LLMs but they restrain agent autonomy.

JoshTriplett · 7h ago

The reliable countermeasure is "stop using LLMs, and build reliable software instead".

danielbln · 5h ago

https://simonwillison.net/2025/Apr/11/camel/

veganmosfet · 5h ago

Is the CaMel paper's idea implemented in some available agents?

Esophagus4 · 1h ago

> The only reliable countermeasures are outside the LLMs but they restrain agent autonomy.

Do those countermeasures mean human-in-the-loop approving actions manually like users can do with Claude Code, for example?

wat10000 · 7h ago

It’s not possible as things currently stand. It’s worrying how often people don’t understand this. AI proponents hate the “they just predict the next token” approach, but it sure helps a lot to understand what these things will actually do for a particular input.

_drewpayment · 7h ago

I think the only way I could see it happening is if you were to build an entire reversal layer with like LangExtract, tried to determine the user's intent from the question and then used that as middleware for how you let the LLM proceed based on its intent... I don't know, it seems really hard.

isodev · 7h ago

I just can’t help but wonder why was it we decided bundling random text generators with browsers was a good idea? I mean it’s a cool toy idea but shipping it to users in a critical application… someone should’ve said no.

thrown-0825 · 5h ago

our societies reward function is fundamentally flawed

paool · 8h ago

Interesting to see the evolution of "Ignore previous instructions. Do ______".

No comments yet

ruslan_sure · 18m ago

"Move fast and break things".

thekevan · 5h ago

To be fair, that was a reddit post that blatantly started with "IMPORTANT INSTRUCTIONS FOR Perplexity Comet". I get the direction they are going but the example shown was so obviously ham-handed. It clearly instructed the browser--in clear language--to get login info and post it in the the thread.

Show me something that is obfuscated and works.

pfg_ · 1h ago

The whole comment is spoilered, so you need to click on it to reveal that text. Presumably it could also appear in a comment that you need to scroll on the page to see.

It's clear to a moderator who sees the comment, but the user asking for a summary could easily have not seen it.

mcintyre1994 · 5h ago

I’m curious if it would work if it was further down the comments or buried in a tree of replies. If all you need to do is be somewhere in the Reddit comments then you don’t need to obfuscate it in many cases, a human isn’t going to see everything there.

Show HN: Clearcam – Add AI Object Detection to Your IP CCTV Cameras in a Minute (github.com)

Show HN: Bicyclopedia (bicyclopedia.lemoing.ca)

Show HN: Port Kill – A lightweight macOS status bar development port monitor (github.com)

Show HN: Publish Markdown – A tool to publish Markdown file in one click (publishmarkdown.com)

Show HN: A "Catalog of Catalogs" for Unified Metadata (github.com)

Show HN: brew-cleaner – CLI to bulk uninstall Homebrew formulae and free space (github.com)

Show HN: JavaScript-free (X)HTML Includes (github.com)

Show HN: Run AI models directly in the browser – no server or internet required (private-ai-chat.vercel.app)

Show HN: LoadGQL – a CLI for load-testing GraphQL endpoints (apps.devanswers.org)

Show HN: Creao – Vibe coding product for founders (creao.ai)

Show HN: Pinch – macOS voice translation for real-time conversations (startpinch.com)

Show HN: I was curious about spherical helix, ended up making this visualization (visualrambling.space)

Show HN: Luminal – Open-source, search-based GPU compiler (github.com)

Show HN: I built aibanner.co to stop spending hours on marketing banners (aibanner.co)

Show HN: I replaced vector databases with Git for AI memory (PoC) (github.com)

Show HN: AgentState – Lightweight state manager for multi-agent AI workflows (github.com)

Show HN: Using Common Lisp from Inside the Browser (turtleware.eu)

Show HN: OS X Mavericks Forever (mavericksforever.com)

Show HN: Python library for fetching/storing/streaming crypto market data (github.com)

Show HN: Splice – CAD for Cable Harnesses and Electrical Assemblies (splice-cad.com)

Show HN: PlutoPrint – Generate PDFs and PNGs from HTML with Python (github.com)

Show HN: ChartDB Cloud – Visualize and Share Database Diagrams (app.chartdb.io)

Show HN: FeOx – Fast embedded KV store in Rust (github.com)

Show HN: Clyp – Clipboard Manager for Linux (github.com)

Show HN: Project management system for Claude Code (github.com)

Show HN: Whispering – Open-source, local-first dictation you can trust (github.com)

Show HN: Anchor Relay – A faster, easier way to get Let's Encrypt certificates (anchor.dev)

Show HN: Nestable.dev – local whiteboard app with nestable canvases, deep links (nestable.dev)

Show HN: What country you would hit if you went straight where you're pointing (apps.apple.com)

Show HN: OpenAI/reflect – Physical AI Assistant that illuminates your life (github.com)

Show HN: I built a toy TPU that can do inference and training on the XOR problem (tinytpu.com)

Show HN: Typed-arrow – compile‑time Arrow schemas for Rust (github.com)

Show HN: Coloring Pages Generator – auto-generate from your description (cutestcoloringpages.com)

Show HN: Map of YC-Funded Startups (patrik-cihal.github.io)

Show HN: AIMless – a 10 KB single file P2P chat app with zero dependencies (github.com)

Show HN: My first game made with my homemade engine (reprobate.site)

Show HN: RepoSentinel – Track dependencies, security, and repo activity (reposentinel.com)

Show HN: Strudel Flow, a pattern sequencer built with Strudel and React Flow (github.com)

Show HN: I integrated my from-scratch TCP/IP stack into the xv6-riscv OS (github.com)

Show HN: Claudable – OpenSource Lovable that runs locally with Claude Code (github.com)

Show HN: Bizcardz.ai – Custom metal business cards (github.com)

Show HN: Fractional jobs – part-time roles for engineers (fractionaljobs.io)

Show HN: BirdNET-Go - AI avian identification & monitoring (github.com)

Show HN: HypeVortex: Combining Crypto and AI in the stupidest way possible (hypevortex.ai)

Show HN: I've made an easy to extend and flexible JavaScript logger (github.com)

Show HN: We started building an AI dev tool but it turned into a Sims-style game (youtube.com)

Show HN: SIMD-optimized Bloom filters in Mojo for large-scale systems (github.com)

Show HN: OverType – A Markdown WYSIWYG editor that's just a textarea

Show HN: Playing Piano with Prime Numbers (nabraj.com)

Show HN: Line of Code Analyzer (github.com)

Agentic Browser Security: Indirect Prompt Injection in Perplexity Comet

Comments (17)