Collect
A server to collect & archive websites that also supports video downloads (by xarantolus)
awesome-datahoarding
List of data-hoarding related tools (by simon987)
Our great sponsors
Collect | awesome-datahoarding | |
---|---|---|
1 | 6 | |
75 | 1,005 | |
- | - | |
0.0 | 4.9 | |
about 1 year ago | 7 months ago | |
TypeScript | ||
MIT License | - |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Collect
Posts with mentions or reviews of Collect.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-01-16.
-
Looking for open source software to scrape webpages but also make them searchable with a webui. (locally hosted)
I created Collect a few years ago and still use it today.
awesome-datahoarding
Posts with mentions or reviews of awesome-datahoarding.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-12-07.
-
All my life was a bloody lecher. Now I have a VPN and wanna pay you all back - but how?
maybe you can check at this sub or this github.
- Ask HN: Looking for a great tool to archive websites
-
need some guidance
Welcome! You are clearly in the right place. If I can give any advice, it would be to take a look at these two links: Awesome-DataHoarding and the wiki of this subreddit. I wish I had both of these resources when I started.
-
How to get started?
i have some stuff in mind, but i'm looking for tools to download it. I found a list https://github.com/simon987/awesome-datahoarding so that answers my own question, mostly. I'm just looking for some tips on how to store my data now.
-
Looking for open source software to scrape webpages but also make them searchable with a webui. (locally hosted)
You might also be interested in this list, those alternatives listed are really great and better, some support the WARC format (that my program doesn't).
- Trying to find a Github containing list of tool projects for backing up (discord, other places)
What are some alternatives?
When comparing Collect and awesome-datahoarding you can also consider the following projects:
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
reventlou - Personal db information management system.
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
n8n - Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.
snapweb - Web interface for Snapcast