code-indexer-loop
relevanceai
code-indexer-loop | relevanceai | |
---|---|---|
3 | 1 | |
161 | 101 | |
5.6% | - | |
6.3 | 6.4 | |
about 1 month ago | 3 months ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
code-indexer-loop
- Python library for indexing and retrieving source code files through an integrated vector database (not mine)
-
Show HN: Code Indexer Loop
Sweep is mentioned as attribution in multiple place a) https://github.com/definitive-io/code-indexer-loop#attributi... b) https://github.com/definitive-io/code-indexer-loop/blob/fd9d...
The difference is packaging it as a consumable PyPI package that can easily be used in a project (they even call out for separating this out into a stand alone project but that they lack the time to do so): https://docs.sweep.dev/blogs/chunking-2m-files#future-
In addition, we expand and fix the implementation, for example it now supports limiting on token count instead of character count, and we fix some white space inconsistencies in parsing/chunk reconstruction.
relevanceai
-
Launch a beautiful projector in under 10 lines of code!
π You can view our github repository here!
What are some alternatives?
bor - User-friendly, tiny source code searcher written by pure Python.
dedupe - :id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
retake - PostgreSQL for Search [Moved to: https://github.com/paradedb/paradedb]
vector-search - The definitive guide to using Vector Search to solve your semantic search production workload needs.
flit - Simplified packaging of Python modules
umap-sharp - C# library for fast embeddings projection using Uniform Manifold Approximation and Projection
Resume-Matcher - Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
CodeSearchNet - Datasets, tools, and benchmarks for representation learning of code.
deeplake - Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
embedditor - β‘ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
ChatData - ChatData π π brings RAG to real applications with FREEβ¨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 million arxiv papers.