Collect
ArchiveBox
Our great sponsors
Collect | ArchiveBox | |
---|---|---|
1 | 203 | |
66 | 15,362 | |
- | 1.1% | |
0.0 | 9.0 | |
about 2 months ago | 14 days ago | |
TypeScript | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Collect
-
Looking for open source software to scrape webpages but also make them searchable with a webui. (locally hosted)
I created Collect a few years ago and still use it today.
ArchiveBox
-
Selfhosted service to screenshot websites - but I'm not finding the options I need
Very clear info on their github page.
-
So...what do you use Docker for??
ArchiveBox
-
Setting up Archivebox on Truenas Scale
I recently got into self-hosting. I've wanted to create a self-hosted web archive and my friend recommended an Archivebox, unfortunately TrueCharts doesn't have a chart for it so I had to do it myself. Here's my guide on how to setup Archivebox on Truenas Scale.
- Ask HN: How do you save and browse external interesting URLs?
-
Alternative to HTTrack (website copier) as of 2023?
Archivebox is a no-go for my needs because I often want to crawl entire domains, and as far as I can tell, they don’t support that: https://github.com/ArchiveBox/ArchiveBox/issues/191
-
Did Mozilla Ever Open Source Pocket?
I also found https://floccus.org/ and https://archivebox.io/ on Alternativeto, for self-hosters.
-
Best way to back up entire website on a schedule
You could also look into something like archivebox.io, but it doesn't really mirror so great. fetchurls can make an URL list though which could in turn be fed into archivebox. Archivebox would maybe be handy if you wanted the wget download along with a PDF print + maybe sending to Wayback Machine.
- Alternative to Wallabag with better web clipper
-
Best (simple) tool for personal Wiki
https://archivebox.io/ Is what I use for that.
What are some alternatives?
paimon-moe - Your best Genshin Impact companion! Help you plan what to farm with ascension calculator and database. Also track your progress with todo and wish counter.
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
SingleFile - Web Extension and CLI tool for saving a faithful copy of an entire web page in a single HTML file
ArchivesSpace - The ArchivesSpace archives management tool
Archivematica - Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.
logseq - A local-first, non-linear, outliner notebook for organizing and sharing your personal knowledge base. Use it to organize your todo list, to write your journals, or to record your unique life.
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
CKAN - CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
knowledge - Everything I know
Access to Memory (AtoM) - Open-source, web application for archival description and public access.
Shiori - Simple bookmark manager built with Go
awesome-selfhosted - A list of Free Software network services and web applications which can be hosted on your own servers