cherche
mindflow
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cherche
-
[P] Semantic search
If you are interested, you can check out the documentation here: https://github.com/raphaelsty/cherche
- Minimalist semantic search with Cherche 2.0
-
[D] is it time to investigate retrieval language models?
Here is a tool I made to create retriever-reader pipeline in a minute: Cherche, would recommend also Haystack on github !
- [P] Cherche - allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.
- Cherche - allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.
- GitHub - raphaelsty/cherche: Neural search
-
[P] Library for end-to-end neural search pipelines
Github link Documentation Hackernews link
-
Hacker News top posts: Jan 10, 2022
Neural Search for medium sized corpora\ (3 comments)
-
Neural search library in Python for medium-sized corpora
https://github.com/raphaelsty/cherche
Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers. Cherche is meant to be used with small to medium sized corpora. Cherche's main strength is its ability to build diverse and end-to-end pipelines.
- Neural Search for medium sized corpora
mindflow
What are some alternatives?
NetShears - iOS Network monitor/interceptor framework
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
primeqa - The prime repository for state-of-the-art Multilingual Question Answering research and development.
marqo - Tensor search for humans. [Moved to: https://github.com/marqo-ai/marqo]
flashtext - Extract Keywords from sentence or Replace keywords in sentences.
google-bard-api - This project provides a FastAPI wrapper for interacting with Google Bard, a conversational AI by Google. It allows users to send messages to Google Bard and receive responses through a simple API.
gpl - Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
ChatGPT-RedditBot - The ChatGPT-RedditBot is a Reddit bot that uses the ChatGPT large language model to generate engaging responses to Reddit threads and submissions.
oneline - Read a text file, one line at a time
gensim - Topic Modelling for Humans
megabots - 🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵
haystack - :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.