DeepGit is a project I created to solve a frustration I've always had: finding truly relevant, high-quality GitHub repositories beyond just keyword searches. Most existing tools rely on simple keyword matching, which often misses out on context and deeper insights.
DeepGit tackles this by combining advanced techniques:
Dense Retrieval (Semantic Search): It understands your queries contextually, not just literally, to find repositories that genuinely match your intent.
Cross-Encoder Re-ranking: Results aren't just relevant—they're also ranked using neural models that deeply understand the content and relevance.
Detailed Analysis: DeepGit reads README files, entire documentation, explores repository structure, checks hardware compatibility, and even evaluates repository engagement and activity.
It's designed to surface hidden gems, useful projects you might not discover otherwise, and it provides meaningful justifications for why each repository fits your search.
DeepGit is fully open-source, built with LangChain and ColBERT embeddings, making it efficient and powerful. It's gotten great early traction and community feedback, reaching over 500 stars within days of making open-source.
You can check out the code, use it yourself, or contribute here: https://github.com/zamalali/DeepGit
There's also a live lite demo on Hugging Face to see DeepGit in action immediately:I ran out of the GPU limit so you can duplicate the space and run it yourself: huggingface.co/spaces/zamal/DeepGit-lite
I'm always looking to make it better and go to place for all the devs to find the best repos. Feedback and contributions are very welcome.
DeepGit tackles this by combining advanced techniques:
Dense Retrieval (Semantic Search): It understands your queries contextually, not just literally, to find repositories that genuinely match your intent.
Cross-Encoder Re-ranking: Results aren't just relevant—they're also ranked using neural models that deeply understand the content and relevance.
Detailed Analysis: DeepGit reads README files, entire documentation, explores repository structure, checks hardware compatibility, and even evaluates repository engagement and activity.
It's designed to surface hidden gems, useful projects you might not discover otherwise, and it provides meaningful justifications for why each repository fits your search.
DeepGit is fully open-source, built with LangChain and ColBERT embeddings, making it efficient and powerful. It's gotten great early traction and community feedback, reaching over 500 stars within days of making open-source. You can check out the code, use it yourself, or contribute here: https://github.com/zamalali/DeepGit
There's also a live lite demo on Hugging Face to see DeepGit in action immediately:I ran out of the GPU limit so you can duplicate the space and run it yourself: huggingface.co/spaces/zamal/DeepGit-lite
I'm always looking to make it better and go to place for all the devs to find the best repos. Feedback and contributions are very welcome.