datasketch
searx
datasketch | searx | |
---|---|---|
1 | 3 | |
2,352 | 8,282 | |
- | - | |
6.4 | 9.5 | |
about 1 month ago | about 3 years ago | |
Python | Python | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
datasketch
-
D Efficient Way To Cluster Millions Of Face
A great library that implements all those "big data" algos is https://github.com/ekzhu/datasketch
searx
- Altsear.ch – yes you can get by without using Google/Yahoo/Bing to search
- Searx 1.0.0: first stable release after 7 years
-
Search engine recommendation: Whoogle
What's the difference between this and Searx?
What are some alternatives?
image-ndd-lsh - Near-duplicate image detection using Locality Sensitive Hashing
searxng - SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
LSH - Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
search-plugins - Search plugins for the search feature
dedup - Find duplicate text files.
Redirector - Browser extension (Firefox, Chrome, Opera, Edge) to redirect urls based on regex patterns, like a client side mod_rewrite.
Neural-Scam-Artist - Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
whoogle-search - A self-hosted, ad-free, privacy-respecting metasearch engine
jina - ☁️ Build multimodal AI applications with cloud-native stack
qutebrowser - A keyboard-driven, vim-like browser based on Python and Qt.
solrpy - Automatically exported from code.google.com/p/solrpy