sosse
marqo
sosse | marqo | |
---|---|---|
2 | 1 | |
24 | 424 | |
- | - | |
9.2 | 10.0 | |
23 days ago | over 1 year ago | |
Python | Python | |
GNU Affero General Public License v3.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sosse
-
Looking for something like ArchiveBox but with the recursive functionality of HTTrack
You could try Sosse (https://github.com/biolds/sosse), it has the crawling capabilities you're looking for and can filter on filetype.. Though it does not provide as much archiving option as ArchiveBox (Sosse can only do screenshot or HTML). Let me know if you have trouble doing the configuration, or you feel like some feature would be great adding.
-
SOSSE Search Engine Release!
I've been working since a bit of time on this free software search engine / crawler. I've done the first stable release today, please enjoy SOSSE v1.0 !
marqo
-
[D] NLP for long document understanding ?
We recently released Marqo which is our approach to tensor search! Feel free to reach out if you'd like to chat about it https://github.com/S2Search/marqo
What are some alternatives?
Studybyte - Studybyte is a search engine designed to help students find educational content effortlessly.
marqo - Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Grab - Web Scraping Framework
jina - ☁️ Build multimodal AI applications with cloud-native stack
4chan-downloader - Python3 script to continuously download all images/webms of multiple 4chan thread simultaneously - without installation
jina-financial-qa-search
TorBot - Dark Web OSINT Tool
ByteDetective - The easiest way to search for images on your desktop 🔎
SearchGar - SearchGar - An actual Search Engine made using Python
sycamore - 🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
searx - Privacy-respecting metasearch engine [Moved to: https://github.com/searx/searx]
mindflow - 🧠 AI-powered CLI git wrapper, boilerplate code generator, chat history manager, and code search engine to streamline your dev workflow 🌊