Ask HN: How do you defend customer support AI agents from prompt injection?

1 theHolyTrynity 0 6/19/2025, 10:35:07 AM

We have built a customer support agent that does the following: - retrieve data around company services from a RAG - is connected to a few tools to escalate to humans and write support tickets - has voice (11labs)

we did complete a POC but now we would like to audit this system to make sure it is safe against major attacks

we did try to follow the design patterns of deepmind, but wondering if there is any tool (preferably free / open source) that can red team our bot

Andrej Karpathy: Software in the era of AI [video] (youtube.com)

The Zed Debugger Is Here (zed.dev)

Show HN: Claude Code Usage Monitor – real-time tracker to dodge usage cut-offs (github.com)

From LLM to AI Agent: What's the Real Journey Behind AI System Development? (codelink.io)

TI to invest $60B to manufacture foundational semiconductors in the U.S. (ti.com)

Show HN: Unregistry – “docker push” directly to servers without a registry (github.com)

Elliptic Curves as Art (elliptic-curves.art)

My iPhone 8 Refuses to Die: Now It's a Solar-Powered Vision OCR Server (terminalbytes.com)

Show HN: Workout.cool – Open-source fitness coaching platform (github.com)

Six-month-old, solo-owned vibe coder Base44 sells to Wix for $80M cash (techcrunch.com)

The Missing 11th of the Month (drhagen.com)

Getting Started Strudel (strudel.cc)

MCP Specification – version 2025-06-18 changes (modelcontextprotocol.io)

Bento: A Steam Deck in a Keyboard (github.com)

Show HN: VS Code extension to share code snippets instantly (snippetshare.dev)

SpaceX Starship 36 Anomaly (twitter.com)

3D printable 6" f/5 compact travel telescope model (printables.com)

The unreasonable effectiveness of fuzzing for porting programs (rjp.io)

Websites are tracking you via browser fingerprinting (engineering.tamu.edu)

The Matrix (1999) Filming Locations – Shot-for-Shot – Sydney, Australia [video] (youtube.com)

Visual History of the Latin Alphabet (uclab.fh-potsdam.de)

Homomorphically Encrypting CRDTs (jakelazaroff.com)

PWM flicker: Invisible light that's harming our health? (caseorganic.medium.com)

Poline – An enigmatic color palette generator using polar coordinates (meodai.github.io)

Fang, the CLI Starter Kit (github.com)

Writing documentation for AI: best practices (docs.kapa.ai)

Game Hacking – Valve Anti-Cheat (VAC) (codeneverdies.github.io)

Law as Rhetoric, Rhetoric as Law: The Arts of Cultural and Communal Life (1985) [pdf] (lwionline.org)

Citizen science illuminates the nature of city lights (nature.com)

New US visa rules will force foreign students to unlock social media profiles (theguardian.com)

Revisiting Minsky's Society of Mind in 2025 (suthakamal.substack.com)

More Front End Web Tricks (kaiwenwang.com)

Attimet (YC F24) – Quant Trading Research Lab – Is Hiring Founding Engineer (ycombinator.com)

Show HN: I built a tensor library from scratch in C++/CUDA (github.com)

Liberux Nexx: An interview with Liberux about their made-in-EU OSHW Linux Phone (linmob.net)

A deep-dive explainer on Ink and Switch's BeeKEM protocol (meri.garden)

Yes I Will Read Ulysses Yes (theatlantic.com)

Dr. Demento Announces Retirement After 55-Year Radio Career (sopghreporter.com)

USDA Pomological Watercolors (search.nal.usda.gov)

Framework Laptop 12 review (arstechnica.com)

Honda conducts successful launch and landing of experimental reusable rocket (global.honda)

Show HN: Gifty – A real-world gift hunt you play with your feet (gifty-en.vercel.app)

Real-time action chunking with large models (pi.website)

Introduction to the A* Algorithm (2014) (redblobgames.com)

I feel open source has turned into two worlds (utcc.utoronto.ca)

After millions of years, why are carnivorous plants still so small? (smithsonianmag.com)

Reasoning by Superposition: A Perspective on Chain of Continuous Thought (arxiv.org)

MiniMax-M1 open-weight, large-scale hybrid-attention reasoning model (github.com)

Polyhedra Viewer (polyhedra.tessera.li)

Scrappy – Make little apps for you and your friends (pontus.granstrom.me)

Ask HN: How do you defend customer support AI agents from prompt injection?

Comments (0)