sosse
TorBot
sosse | TorBot | |
---|---|---|
2 | 1 | |
24 | 2,649 | |
- | 4.1% | |
9.2 | 8.5 | |
23 days ago | about 1 month ago | |
Python | Python | |
GNU Affero General Public License v3.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sosse
-
Looking for something like ArchiveBox but with the recursive functionality of HTTrack
You could try Sosse (https://github.com/biolds/sosse), it has the crawling capabilities you're looking for and can filter on filetype.. Though it does not provide as much archiving option as ArchiveBox (Sosse can only do screenshot or HTML). Let me know if you have trouble doing the configuration, or you feel like some feature would be great adding.
-
SOSSE Search Engine Release!
I've been working since a bit of time on this free software search engine / crawler. I've done the first stable release today, please enjoy SOSSE v1.0 !
TorBot
What are some alternatives?
Studybyte - Studybyte is a search engine designed to help students find educational content effortlessly.
freshonions-torscraper - Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Grab - Web Scraping Framework
OpenWPM - A web privacy measurement framework
4chan-downloader - Python3 script to continuously download all images/webms of multiple 4chan thread simultaneously - without installation
maskphish - Introducing "URL Making Technology" to the world for the very FIRST TIME. Give a Mask to Phishing URL like a PRO.. A MUST have tool for Phishing.
SearchGar - SearchGar - An actual Search Engine made using Python
gentor - GenTor - Make your internet traffic anonymized through Tor network.
searx - Privacy-respecting metasearch engine [Moved to: https://github.com/searx/searx]
GHunt - 🕵️♂️ Offensive Google framework.
marqo - Tensor search for humans. [Moved to: https://github.com/marqo-ai/marqo]
phoneinfoga - Information gathering framework for phone numbers