Show HN: RAG Firewall – retrieval-time guardrails for LangChain/LlamaIndex

1 talbuilds 1 8/29/2025, 7:48:12 PM github.com ↗

RAG pipelines are great, but they can still retrieve "toxic" chunks: – prompt injection attempts – leaked API keys/secrets – stale or conflicting content – unapproved external URLs

We built an open-source "retrieval firewall" that scans chunks before they reach the LLM: – denies injection & secrets – flags/reranks PII, encoded blobs, untrusted URLs – audit log (JSONL) of all decisions – drop-in wrappers for LangChain and LlamaIndex retrievers

Install: pip install rag-firewall Repo: https://github.com/taladari/rag-firewall

Curious if others here handle retrieval-time risks, or just ingest/output filtering. Would love feedback and red-team payloads.

Comments (1)

talbuilds · 1h ago

A couple of extra notes I didn’t fit in the main post:

– The firewall runs entirely client-side, so no data ever leaves your environment.

– It focuses on *retrieval-time* risks, not output moderation — so the LLM never sees poisoned chunks in the first place.

– Policies are YAML: you can choose to deny, allow, or just re-rank risky docs (based on recency, provenance, relevance).

– Overhead is low: scanners are regex/heuristic, so for ~5–20 chunks it adds only a few ms.

I’d love feedback on two things in particular:

1. Do you think retrieval-time filtering belongs in the pipeline, or should it all be done at ingest/output?

2. If you’ve got prompt injection payloads or edge cases you use to test your own RAG stacks, I’d love to try them against this.

Thanks for taking a look — always happy to hear critique, especially from folks running LangChain/LlamaIndex in production.

Thunder Compute (YC S24) Is Hiring (ycombinator.com)

Deepnote (YC S19) is hiring engineers to build a better Jupyter notebook (deepnote.com)

Prosper AI (YC S23) Is Hiring Founding Account Executives (NYC) (jobs.ashbyhq.com)

The Forecasting Company (YC S24) Is Hiring a Software Engineer (ycombinator.com)

Lago – Open-Source Usage Based Billing – Is Hiring in Sales, Eng, Ops (EU, US) (ycombinator.com)

Ember (YC F24) Is Hiring Full Stack Engineer (ycombinator.com)

LiteLLM (YC W23) is hiring a back end engineer (ycombinator.com)

SigNoz (YC W21, Open Source Datadog) Is Hiring Platform Engineers (Remote) (jobs.ashbyhq.com)

Motion (YC W20) Is Hiring Principal Software Engineers (jobs.ashbyhq.com)

Bild AI (YC W25) Is Hiring an Applied AI Engineer (workatastartup.com)

Text.ai (YC X25) Is Hiring Founding Full-Stack Engineer (ycombinator.com)

Cua (YC X25) is hiring design engineers in SF (ycombinator.com)

Activeloop (YC S18) Is Hiring Member of Technical Staff – Back End Engineering (careers.activeloop.ai)

Coris (YC S22) Is Hiring (ycombinator.com)

14.ai (YC W24) is hiring engineers in SF to build an AI-native Zendesk (14.ai)

Spice Data (YC S19) Is Hiring a Product Associate (New Grad) (ycombinator.com)

Ashby (YC W19) Is Hiring Design Engineers in AMER and EMEA (ashbyhq.com)

EasyPost (YC S13) Is Hiring (easypost.com)

Tesorio (YC S15) Is Hiring a Senior GenAI Engineer (100% Remote) (tesorio.com)

OneSignal (YC S11) Is Hiring Engineers (onesignal.com)

Axle (YC S22) is hiring product engineers (ycombinator.com)

Mbodi AI (YC X25) Is Hiring a Founding Research Engineer (Robotics) (ycombinator.com)

ReadMe (YC W15) Is Hiring a Developer Experience PM (readme.com)

Weave (YC W25) is hiring a founding AI engineer (ycombinator.com)

Depot (YC W23) Is Hiring a Community and Events Manager (Remote) (ycombinator.com)

CoLoop (YC S21) Is Hiring AI Engineers in London

Trellis (YC W24) Is Hiring: Automate Prior Auth in Healthcare (ycombinator.com)

Type (YC W23) is hiring a founding engineer to build an AI-native doc editor (ycombinator.com)

Foundry (YC F24) is hiring staff-level product engineers (ycombinator.com)

GoGoGrandparent (YC S16) Is Hiring Back End and Full-Stack Engineers

Kyber (YC W23) is hiring enterprise account executives (ycombinator.com)

Show HN: RAG Firewall – retrieval-time guardrails for LangChain/LlamaIndex

Comments (1)