Collect vs awesome-datahoarding

Collect

A server to collect & archive websites that also supports video downloads (by xarantolus)

Source Code

010.one

Suggest alternative

Edit details

awesome-datahoarding

List of data-hoarding related tools (by simon987)

Suggest topics

Source Code

Suggest alternative

Edit details

Our great sponsors

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

Our great sponsors

Collect		awesome-datahoarding
	Project
1	Mentions	6
75	Stars	1,005
-	Growth	-
0.0	Activity	4.9
about 1 year ago	Latest Commit	7 months ago
TypeScript	Language
MIT License	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Collect

Posts with mentions or reviews of Collect. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-01-16.

Looking for open source software to scrape webpages but also make them searchable with a webui. (locally hosted)
4 projects | /r/DataHoarder | 16 Jan 2021

I created Collect a few years ago and still use it today.

awesome-datahoarding

Posts with mentions or reviews of awesome-datahoarding. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-07.

All my life was a bloody lecher. Now I have a VPN and wanna pay you all back - but how?
2 projects | /r/Piracy | 7 Dec 2023

maybe you can check at this sub or this github.
Ask HN: Looking for a great tool to archive websites
2 projects | news.ycombinator.com | 14 Apr 2023
need some guidance
1 project | /r/DataHoarder | 19 Nov 2022

Welcome! You are clearly in the right place. If I can give any advice, it would be to take a look at these two links: Awesome-DataHoarding and the wiki of this subreddit. I wish I had both of these resources when I started.
How to get started?
1 project | /r/DataHoarder | 20 Jul 2022

i have some stuff in mind, but i'm looking for tools to download it. I found a list https://github.com/simon987/awesome-datahoarding so that answers my own question, mostly. I'm just looking for some tips on how to store my data now.
Looking for open source software to scrape webpages but also make them searchable with a webui. (locally hosted)
4 projects | /r/DataHoarder | 16 Jan 2021

You might also be interested in this list, those alternatives listed are really great and better, some support the WARC format (that my program doesn't).
Trying to find a Github containing list of tool projects for backing up (discord, other places)
1 project | /r/DataHoarder | 10 Jan 2021

What are some alternatives?

When comparing Collect and awesome-datahoarding you can also consider the following projects:

grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

reventlou - Personal db information management system.

SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file

ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

n8n - Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.

snapweb - Web interface for Snapcast