Studybyte
sosse
Our great sponsors
Studybyte | sosse | |
---|---|---|
1 | 2 | |
16 | 23 | |
- | - | |
0.0 | 9.3 | |
over 1 year ago | 7 days ago | |
Python | Python | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Studybyte
sosse
-
Looking for something like ArchiveBox but with the recursive functionality of HTTrack
You could try Sosse (https://github.com/biolds/sosse), it has the crawling capabilities you're looking for and can filter on filetype.. Though it does not provide as much archiving option as ArchiveBox (Sosse can only do screenshot or HTML). Let me know if you have trouble doing the configuration, or you feel like some feature would be great adding.
-
SOSSE Search Engine Release!
I've been working since a bit of time on this free software search engine / crawler. I've done the first stable release today, please enjoy SOSSE v1.0 !
What are some alternatives?
resin - Vector space search engine. Available as a HTTP service or as an embedded library.
Grab - Web Scraping Framework
rats-search - BitTorrent P2P multi-platform search engine for Desktop and Web servers with integrated torrent client.
4chan-downloader - Python3 script to continuously download all images/webms of multiple 4chan thread simultaneously - without installation
eduhub-website - Hey it's a community website so if you want to contrinute in this project then go through the README.md section and also click the link. Raise Genuine PRs only. Your PRs will be accepted, keep patience. Star This Repo. You aren't allowed to Update README.md. Welcoming developers, content writers, and programming enthusiasts.
TorBot - Dark Web OSINT Tool
achoz - Search through all your personal data efficiently like web search.
SearchGar - SearchGar - An actual Search Engine made using Python
django-admin-site-search - A search (cmd+k) modal, for the Django admin UI, that searches your entire site.
searx - Privacy-respecting metasearch engine [Moved to: https://github.com/searx/searx]
searchAPI - A simple API to get the search engines results.
marqo - Tensor search for humans. [Moved to: https://github.com/marqo-ai/marqo]