warcprox
conifer
warcprox | conifer | |
---|---|---|
7 | 5 | |
363 | 1,457 | |
1.1% | -0.3% | |
6.4 | 0.0 | |
7 months ago | 6 months ago | |
Python | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
warcprox
-
Offpunk 2.0
I've looked into archiving all the pages i visit as well and warcprox[1] has been bookmarked for a while now
Hard drive storage space being so cheap in the ~$15/TB range makes this more feasible even for video archival
[1] https://github.com/internetarchive/warcprox
- What is warcprox ?
-
r18 database of metadata
wget also supports WARC options if you don't need javascript etc. If you do, there's also Warcprox (https://github.com/internetarchive/warcprox), brozzler (https://github.com/internetarchive/brozzler) (which uses warcprox internally), and others.
- [HELP] I´m looking for some self-hosted solution where users can connect to the website, connect to a page, browse it and save it.
-
tofuproxy – web proxy, TLS terminator, X.509 TOFU manager, WARC/gemini browser
Wow, it's really rare these days to see a tool that supports WARC.
Despite being an ISO standard [1] and the default archive format of the internet archive, and despite a handfull of lovingly crafted tools (such as webrecorder [2], warcprox etc.), it never seems to have caught on in a broader context.
Really a shame - I' deeply convinced that the ability to archive and replay requests is a technique for defending and strengthening user rights.
Links:
[1] https://www.iso.org/standard/44717.html
[2] https://github.com/webrecorder/webrecorder-desktop
[3] https://github.com/internetarchive/warcprox
- Browser Extension for Saving Images As While Browsing
-
How to archive the tweets and replies of my own terminated twitter account(s)
However, if you prefer something open-source, you could accomplish the same thing with a tool like https://archiveweb.page/ or https://github.com/internetarchive/warcprox
conifer
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
- I have no idea how Github works. I need to download a Conifer archiving tool. Can someone explain how to do? https://github.com/Rhizome-Conifer/conifer
-
[HELP] I´m looking for some self-hosted solution where users can connect to the website, connect to a page, browse it and save it.
I saw Rhizome-conifer(https://github.com/Rhizome-Conifer/conifer). It look great but there is a lot of open issues on github, some from 2016, and i´m afraid if it will not keep up (it would be very sad).
- Looking for a tool that generates reader mode of articles
-
Ask HN: What browser extensions are a must-have for HNers in 2021?
I would personally recommend
https://github.com/rhizome-conifer/conifer
The intent is a webrecorder for the internet.
It records all js libraries, loads videos, and all else.
Once stored, you can review a snapshot at that point in time.
They have a service option, webrecorder.io, but this one let's you store directly locally.
What are some alternatives?
replayweb.page - Serverless replay of web archives directly in the browser
pywb - Core Python Web Archiving Toolkit for replay and recording of web archives
TWINT - An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
brozzler - brozzler - distributed browser-based web crawler
Reddit-Enhancement-Suite - Reddit Enhancement Suite
webrecorder-desktop - Webrecorder Desktop App!
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
auto-save-html - Firefox extension that automatically dumps HTML when browsing a specified site
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
mpiv - A fully reworked fork of Mouseover Popup Image Viewer
temporal-shift-module - [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding