Show HN: I spent 8 months trying to make LLMs Hack

2 Adeeb0123 2 5/24/2025, 5:11:49 PM
Hey HN!

For that last 8 months I've been trying to make agents that can hack web applications to find vulnerabilities in them - An AI Security Tester.

The system has 29 agents in total, a custom LLM Orchestration framework which works on the task-subtask architecture (old-school but works amazingly for my use case, and is pretty reliable) with custom agent calling mechanism.

No Auo-Gen, Langchain and Crew AI - Everything custom built for pentesting.

Each test runs in an isolated Kali linux environment (on AWS Fargate), where the agents have full access to the environment to undertake any step to hack the web application and find vulnerabilities. The agents have full access to the internet (through tavily) to search up and research content while conducting the test.

After the test has been completed, which can take anywhere from 2-12 hours depending on the target, Peneterrer gives a full Vulnerability Management portal + A Pentest report completely generated by AI (sometimes 30+ pages long)

You can test it out here - https://peneterrer.com/

Feedback appreciated!

Comments (2)

NitpickLawyer · 2h ago
How many real-world issues have you found, submitted and acknowledged/rewarded for?
Adeeb0123 · 1h ago
So did this test for one guy (just after launch, cold DM), and found out that his react code could be traversed under /assets - they had somehow put code files in their /public directory while building their app.

Right now, its's only been a month since launch and currently 123+ businesses have used this