Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
semantic-code-search
Search your codebase with natural language • CLI • No data leaves your computer
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Currently it is hard limited to these file extensions: https://github.com/kantord/SeaGOAT/blob/ebfde263b970ddecdddf...
It is to avoid wasting time processing files that cannot lead to good results. If you want to try it for a different programming language, please fork the repo and try adding your file formats and test if it gives meaningful results, and if it does please submit a pull request.
Other than that one limitation is that it uses a model under the hood that is trained on a specific dataset which is filtered for a specific list of programming languages. So without changing the model as well, the support for other languages could be subpar. At the moment the model is all-MiniLM-L6-v2, here's a detailed summary of the dataset: https://huggingface.co/sentence-transformers/all-MiniLM-L6-v...
btw I am also working on a web version of it that will allow you to search in multiple repositories at the same time and you will be able to self host it at work, or run it locally in your machine. https://github.com/kantord/SeaGOAT-web
so that could provide a nicer interactive experience for more complex queries
Semantra! Shared it yesterday on HN https://github.com/freedmand/semantra
I've been test driving a similar one https://github.com/sturdy-dev/semantic-code-search
But yours has a more permissive license!
I also had to modify it a bit to allow for the line endings I needed and it frustratingly doesn't allow specifying a path, and often returns tests instead of code
UniteAI brings together speech recognition and document / code search. The major difference is your UI is your preferred text editor.
https://github.com/freckletonj/uniteai
Related posts
- Reviewing AI Code Search Tools
- Show HN: SeaGOAT – local, “AI-based” grep for semantic code search
- HuggingFace text-generation-inference is reverting to Apache 2.0 License
- Show HN: LLMWare – Small Specialized Function Calling 1B LLMs for Multi-Step RAG
- Show HN: LLMWare – Integrated Solution for RAG in Finance and Legal