information-retrieval
cherche
information-retrieval | cherche | |
---|---|---|
3 | 12 | |
147 | 313 | |
- | - | |
0.0 | 4.4 | |
9 months ago | 19 days ago | |
Jupyter Notebook | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
information-retrieval
-
[D] Generative vs embedding models
You can check out my repo https://github.com/kuutsav/information-retrieval. Here I've implemented some of the common embedding techniques from sbert from scratch.
-
Bi-Encoder with BERT does not learn
I have a bunch of training scripts here that night help you figure out the bug(if any) https://github.com/kuutsav/information-retrieval
-
[P] Created tutorials on Information Retrieval, specifically Semantic Search
Hi, I've created a repo which tries to cover the current progress in the world of information-retrieval using neural information retrievers / semantic search. Repo: https://github.com/kuutsav/information retrieval .
cherche
-
[P] Semantic search
If you are interested, you can check out the documentation here: https://github.com/raphaelsty/cherche
- Minimalist semantic search with Cherche 2.0
-
[D] is it time to investigate retrieval language models?
Here is a tool I made to create retriever-reader pipeline in a minute: Cherche, would recommend also Haystack on github !
- [P] Cherche - allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.
- Cherche - allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.
- GitHub - raphaelsty/cherche: Neural search
-
[P] Library for end-to-end neural search pipelines
Github link Documentation Hackernews link
-
Hacker News top posts: Jan 10, 2022
Neural Search for medium sized corpora\ (3 comments)
-
Neural search library in Python for medium-sized corpora
https://github.com/raphaelsty/cherche
Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers. Cherche is meant to be used with small to medium sized corpora. Cherche's main strength is its ability to build diverse and end-to-end pipelines.
- Neural Search for medium sized corpora
What are some alternatives?
elastic_transformers - Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers
NetShears - iOS Network monitor/interceptor framework
ttds-cw3-research-team - A Search Engine To Find Research Papers & Datasets
primeqa - The prime repository for state-of-the-art Multilingual Question Answering research and development.
flashtext - Extract Keywords from sentence or Replace keywords in sentences.
gpl - Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
mindflow - 🧠AI-powered CLI git wrapper, boilerplate code generator, chat history manager, and code search engine to streamline your dev workflow 🌊
oneline - Read a text file, one line at a time
megabots - 🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵
rank_bm25 - A Collection of BM25 Algorithms in Python
weaviate-txtai - An integration of the weaviate vector search engine with txtai
mteb - MTEB: Massive Text Embedding Benchmark