Subscribe and receive the latest tech updates, startup insights, and industry trends delivered straight to your inbox.
We respect your privacy. Unsubscribe at any time.
When we were building stuff we needed a tool that made it easy to try out different code embedding models so that we could see which ones worked best in different scenarios and understand their strengths and weaknesses.
So to speed that process up we made PurpleSearch an 'instant' search engine for your local codebases. This tool lets you quickly deploy any open source embedding model on hugging face into a real application that makes their capabilities visual and easy to digest. Check out the demo vid.
The flow: Pick any Java project. We'll turn it into a code graph, embed code snippets from the graph and index them. Then you can query the code snippet index. We'll return the 10 highest ranking snippets and the the subgraph that surrounds them for deeper contextual understanding (we call it a cluster). We feed these clusters into an LLM who uses them as context to answer your query.
Purple Search!
Demo video: https://www.youtube.com/watch?v=aglrXDLoOhQ
Github: https://github.com/logantucker25/PurpleSearch
note: This tool might be a little rough around the edges and occasionally get hung up. It hasnt been fully tested for use outside of our small group. We hope that it is useful to someone out there... if so feel free add to this code in any way that you like and make it into whatever suits you best. PRs welcome.