sosse
Studybyte
sosse | Studybyte | |
---|---|---|
2 | 1 | |
24 | 16 | |
- | - | |
9.2 | 0.0 | |
26 days ago | over 1 year ago | |
Python | Python | |
GNU Affero General Public License v3.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sosse
-
Looking for something like ArchiveBox but with the recursive functionality of HTTrack
You could try Sosse (https://github.com/biolds/sosse), it has the crawling capabilities you're looking for and can filter on filetype.. Though it does not provide as much archiving option as ArchiveBox (Sosse can only do screenshot or HTML). Let me know if you have trouble doing the configuration, or you feel like some feature would be great adding.
-
SOSSE Search Engine Release!
I've been working since a bit of time on this free software search engine / crawler. I've done the first stable release today, please enjoy SOSSE v1.0 !
Studybyte
What are some alternatives?
Grab - Web Scraping Framework
resin - Vector space index based search engine that's available as a HTTP service or as an embedded library.
4chan-downloader - Python3 script to continuously download all images/webms of multiple 4chan thread simultaneously - without installation
rats-search - BitTorrent P2P multi-platform search engine for Desktop and Web servers with integrated torrent client.
TorBot - Dark Web OSINT Tool
eduhub-website - Hey it's a community website so if you want to contrinute in this project then go through the README.md section and also click the link. Raise Genuine PRs only. Your PRs will be accepted, keep patience. Star This Repo. You aren't allowed to Update README.md. Welcoming developers, content writers, and programming enthusiasts.
SearchGar - SearchGar - An actual Search Engine made using Python
achoz - Search through all your personal data efficiently like web search.
searx - Privacy-respecting metasearch engine [Moved to: https://github.com/searx/searx]
django-admin-site-search - A search (cmd+k) modal, for the Django admin UI, that searches your entire site.
marqo - Tensor search for humans. [Moved to: https://github.com/marqo-ai/marqo]
searchAPI - A simple API to get the search engines results.