trailcap
obelisk
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
trailcap
-
SingleFile: Save a Complete Web Page into a Single HTML File
Thanks for this project. I found SingleFile a year or two ago, and used it to take "HTML Screenshots" of third party sites I could embed in guided walkthroughs with modified/example data changed, instead of just PNGs.
SingleFile was ultra-valuable for this.
If anyone has a similar use-case, I wrote some pretty rough (and slow) code to post-process SingleFile's output to remove any HTML that wasn't contributing to the presentational render by launching puppeteer and comparing pixels. It's available here: https://github.com/mieko/trailcap
obelisk
-
Looking for a library to archive a webpage to store it in a database (like SingleFile)
I've made Obelisk several years ago for archival purpose. It's inspired by monolith with several improvements.
-
SingleFile alternatives - obelisk and wayback
3 projects | 2 Mar 2022
Go package and CLI tool for saving web page as single HTML file
- SingleFile: Save a Complete Web Page into a Single HTML File
What are some alternatives?
SingleFile-MV3 - SingleFile version compatible with Manifest V3. The future, right now!
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
awesome-web-archiving - An Awesome List for getting started with web archiving
cairn - NPM package and CLI tool for saving web page as single HTML file
firefox-scrapbook - ScrapBook X – a legacy Firefox add-on that captures web pages to local device for future retrieval, organization, annotation, and edit.
webextensions - Charter and administrivia for the WebExtensions Community Group (WECG)
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
wayback - A bot for Telegram, Mastodon, Slack, and other messaging platforms archives webpages.