Show HN: Compliant LLM toolkit for ensuring compliance & security of AI systems

7 kaushik92 0 6/3/2025, 2:51:51 PM github.com ↗

With the right technique, I was able to break the so-called secure models like Claude and OpenAI.

So, I built an open-source tool to automate this and find security holes in any hosted model.

I got claude-sonnet-4 to demonstrate the following harmful behavior:

- steal data from downstream tool calls using sql injection, code injection and template injection attacks

- install spyware or malware using prompt obfuscation to send data to a third-party server

Try it yourself with this simple command:

  pip install compliant-llm && compliant-llm dashboard

Show HN: GPT image editing, but for 3D models (adamcad.com)

Show HN: Ephe – A minimalist open-source Markdown paper for today (github.com)

Show HN: TinyToggle – Lightweight feature flags for devs and small teams (tinytoggle.com)

Show HN: Bloom – AI-Powered Jupyter IDE for SQL and Python Workflows (joinbloom.ai)

Show HN: Tiptap AI Agent – Add AI workflows to your text editor in minutes

Show HN: Bonvago.com – Shows hidden hotel discounts and bonus rewards

Show HN: OSS's Answer to Cursor's Codebase Level Context for Large Projects (github.com)

Show HN: Awesome-A2A – curated resources for Google's Agent2Agent protocol (github.com)

Show HN: A wah effect plugin for regular DAWs, web browsers and C64 (clack.digital)

Show HN: The never-ending story, create your own 8-bit AI adventure. (neverendingstory.ca)

Show HN: Controlling 3D models with voice and hand gestures (github.com)

Show HN: Tweety – An Integrated Terminal for Any Chromium-Based Browser (github.com)

Show HN: Hacker News historic upvote and score data (hn.dunkirk.sh)

Show HN: This Hacker News does not exist (thishackernewsdoesnotexist.com)

Show HN: I built an OSINT tools directory (r00m101.com)

Show HN: Localize React apps without rewriting code (github.com)

Show HN: NodeCosmos – Open-source products beyond software

Show HN: I wrote a Java decompiler in pure C language (github.com)

Show HN: AirAP AirPlay server – AirPlay to an iOS Device (github.com)

Show HN: Gradle plugin for faster Java compiles (github.com)

Show HN: An Alfred workflow to open GCP services and browse resources within (github.com)

Show HN: I build one absurd web project every month (absurd.website)

Show HN: Mosaique.info – Global news in context (solo dev, no ads, no tracking) (mosaique.info)

Show HN: LLMFeeder – Browser extension to extract clean content for LLM context (github.com)

Show HN: Kan.bn – An open-source alterative to Trello (github.com)

Show HN: Rust Based Per-Token Late Interaction Dense Search (github.com)

Show HN: Asciilator.com (asciilator.com)

Show HN: Thriftled – DoorDash for local thrift stores (books and clothes first) (thriftled.com)

Show HN: Onlook – Open-source, visual-first Cursor for designers (github.com)

Show HN: A toy version of Wireshark (student project) (github.com)

Show HN: A free online tool to export all PDF annotations (pdfhighlightextractor.com)

Show HN: Patio – Rent tools, learn DIY, reduce waste (patio.so)

Show HN: Moon Phase Algorithms for C, Lua, Awk, JavaScript, etc. (github.com)

Show HN: Penny-1.7B Irish Penny Journal style transfer (huggingface.co)

Show HN: Slurm-web – open-source lightweight web UI for Slurm HPC/AI clusters (slurm-web.com)

Show HN: I'm Building Ahrefs for AI Search Results (linrush.com)

Show HN: MBCompass – Android Compass App (github.com)

Show HN: I made an AI that turn live lecture into structured notes,mind-maps,PDF (notorium.app)

Show HN: All-in-one platform for AI image generation (imageninja.ai)

Show HN: I built an AI Agent that uses the iPhone (github.com)

Show HN: CrowdRender – collaborative rendering plugin for Blender (crowd-render.com)

Show HN: Ultra-lightweight chunker library with emoji support (github.com)

Show HN: A Implementation of Alpha Zero for Chess in MLX (github.com)

Show HN: Page Magic: Use AI to customize any web page (github.com)

Show HN: PunchCard Key Backup (github.com)

Show HN: Agno – A full-stack framework for building Multi-Agent Systems (github.com)

Show HN: Text to 3D simulation on a map (does history pretty well) with gmaps++ (worldlens.co)

Show HN: FLOX – C++ framework for building trading systems (github.com)

Show HN: PinSend – Share text between devices using a PIN(P2P, no login) (pinsend.app)

Show HN: .NET Threading Mystery Classes (github.com)

Show HN: Compliant LLM toolkit for ensuring compliance & security of AI systems

Comments (0)