Show HN: Building code agents? Try code embedding models FAST and do it better

4 islogan 0 5/1/2025, 4:43:33 AM
We are a small group of undergrads interested in building human in the loop coding agents. We dream of a world where building complex agent workflows feels as simple and creative as playing with legos.

When we were building stuff we needed a tool that made it easy to try out different code embedding models so that we could see which ones worked best in different scenarios and understand their strengths and weaknesses.

So to speed that process up we made PurpleSearch an 'instant' search engine for your local codebases. This tool lets you quickly deploy any open source embedding model on hugging face into a real application that makes their capabilities visual and easy to digest. Check out the demo vid.

The flow: Pick any Java project. We'll turn it into a code graph, embed code snippets from the graph and index them. Then you can query the code snippet index. We'll return the 10 highest ranking snippets and the the subgraph that surrounds them for deeper contextual understanding (we call it a cluster). We feed these clusters into an LLM who uses them as context to answer your query.

Purple Search!

Demo video: https://www.youtube.com/watch?v=aglrXDLoOhQ

Github: https://github.com/logantucker25/PurpleSearch

note: This tool might be a little rough around the edges and occasionally get hung up. It hasnt been fully tested for use outside of our small group. We hope that it is useful to someone out there... if so feel free add to this code in any way that you like and make it into whatever suits you best. PRs welcome.

Comments (0)

No comments yet