mwmbl
bertsearch
mwmbl | bertsearch | |
---|---|---|
27 | - | |
1,362 | 886 | |
1.0% | - | |
9.4 | 0.0 | |
9 days ago | about 1 year ago | |
Python | Python | |
GNU Affero General Public License v3.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mwmbl
- FLaNK Stack Weekly 19 Feb 2024
-
Text Processing Practice Expt: 27 SERP Types to SQLite (Yy084)
echo "https://mwmbl.org/?q=$x"|client 185.34.32.175
-
How bad are search results? Compare Google, Bing, Marginalia, Kagi, and ChatGPT
Ironically I had to use a search engine to discover what "Mwmbl" was. It's apparently a search engine. But, visiting the front page, I see something akin to a git commit log?! I'm not sure I'd have guessed that this was a SE if Brave Search did not tell me it was (even then I'm not convinced yet).
https://mwmbl.org/
-
Indexing a Billion Pages
I believe this is closer to the thing you were asking about, and the simple answer appears to be "a home grown one in python" https://github.com/mwmbl/mwmbl/blob/e544d45c374c13cdc1a5048d...
- Welcome to mwmbl, the free, open-source and non-profit search engine
- Marginalia.nu API
- Show HN: Ichido, search engine that tags sites using Google and Cloudflare
- Introduction!
- Mwmbl, the free, open-source and non-profit search engine
bertsearch
We haven't tracked posts mentioning bertsearch yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
Lobsters - Computing-focused community centered around link aggregation and discussion
jina-financial-qa-search
whoogle-search - A self-hosted, ad-free, privacy-respecting metasearch engine
happy-transformer - Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
PiTheremin
bertviz - BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
code-search-blocklist - A list of domains hosting scrapped code snippets and polluting search results to block.
haystack - :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
ublock-origin-shitty-copies-filter - Filter for DuckDuckGo and Google to remove those spam-websites that just blatantly copy and paste content from well known websites.
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ublacklist - Blocks specific sites from appearing in Google search results
clip-as-service - 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP