SingleFile
percollate
Our great sponsors
SingleFile | percollate | |
---|---|---|
94 | 14 | |
13,375 | 4,089 | |
- | - | |
9.7 | 5.9 | |
14 days ago | 2 months ago | |
JavaScript | JavaScript | |
GNU Affero General Public License v3.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SingleFile
-
My website is one binary
I agree it would be "great" a complete website in the ZIP. I think this is technically possible, someone just have to code it.
-
Omnivore – free, open source, read-it-later App
Singlefile [1] works pretty well for me for that use case.
It has the added advantage that the file format is just plain HTML, and together with “reader mode” in most browsers, it’s a great way to save long-form text or other mostly static pages for later reference.
It obviously doesn’t work for very dynamic pages, let alone web apps.
-
Pocket: It gets worse the more you use it
I’ve tried all the third party services for archiving interesting things over the years but nothing beats saving everything to your local filesystem using [SingleFile](https://github.com/gildas-lormeau/SingleFile) and using a full-text search front over the directory (something like Houdahspot, for example).
- Save webpages into Obsidian (mobile)
- Wayback: Self-hosted archiving service integrated with Internet Archive
-
Ask HN: Looking for a great tool to archive websites
For small numbers of pages, the SingleFile[0] extension for Firefox (WebExtension) is pretty handy. It's not "archival quality", though, if that's the kind of "archiving" you're doing.
- Selfhosted service to screenshot websites - but I'm not finding the options I need
-
App that has Web Clipper like Evernote
What I do now is, if I just want to save something for later reading then I save it to Pocket, and if I want to archive it i use the Single File extension to save the page as is.
-
Looking for a library to archive a webpage to store it in a database (like SingleFile)
While searching I found https://github.com/gildas-lormeau/SingleFile which would work, but I would like to stay in the Go Ecosystem. Is there a library which I didn’t find yet?
-
How to print all pages of a website?
Install this: https://github.com/gildas-lormeau/SingleFile
percollate
-
The Case Against AI Everything, Everywhere, All at Once
You can still choose automation. The easier route for me is to use wallabag to save the article. Then on my remarkable tablet I can grab a very readable document with https://github.com/koreader/koreader.
The other option is to use https://github.com/danburzo/percollate to convert a webpage to a nice document directly. I use both tools depending on my needs.
- Selfhosted service to screenshot websites - but I'm not finding the options I need
-
ArchiveBox Alternative
The Cli Tool Percollate offers a different approach, but is also very good: https://github.com/danburzo/percollate
-
Is there a command line program to convert web pages into readable markdown/htm/pdf format? preferably markdown
Concerning pdf there is the well known wkhtmltopdf , but let me say that I love the not so well known percollate
-
Show HN: Lurnby, a tool for better learning, is now open source
Since I'm working on a similar project, this is how I am planning to pull content from the web, utilizing percollate[1] to get the HTML content, I haven't written any implementation for this in Python yet.
If you don't mind me asking, how were you going to implement spaced repetition? Since the Incremental Reading algorithm has never been published as far as I know.
- What Are The Best Linux Apps?
-
Alternatives to ArchiveBox?
Maybe https://github.com/danburzo/percollate, I didnt try it and I am not sure if the html output looks like u want it.
-
Reading from the web offline and distraction-free
I do a lot of this work[3] (web to documents) and it's interesting to see other approaches. The medium image problem is something I've faced as well, but never got around to fixing. I'm planning to get a Remarkable soon, so will definitely be trying this out.
My personal solution has been https://github.com/captn3m0/url-to-epub/ (Node/readability), which I've tested against the entirety of Tor's original fiction collection[0] where it performs well enough (I'm biased). Another tool that does this beautifully well is percollate[1], but it doesn't give enough control of the metadata to the user - something I really care about.
I've also started to use rdrview[2], which is a C-port of the current Firefox implementation of "reader view". It is very unix-y, so it is easy to pipe content to it (I usually run it through tidy first). Quite helpful in building web-archiving or web-to-pdf or web-to-kindle pipelines easily.
[0]: https://www.tor.com/category/all-fiction/original-fiction/
[1]: https://github.com/danburzo/percollate
-
A little npm head-scratcher
A JavaScript project I maintain has the following file structure, abridged:
What are some alternatives?
leetcode-rating-predictor - Leetcode Rating Predictor built with Node. Browser extension and web interface.
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
page-ruler-redux - An awesome page ruler extension for google chrome
monolith - ⬛️ CLI tool for saving complete web pages as a single HTML file
sidebery - Firefox extension for managing tabs and bookmarks in sidebar.
headless-recorder - Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.
SnappySnippet - Chrome extension that allows easy extraction of CSS and HTML from selected element.
rdrview - Firefox Reader View as a command line tool
webscrapbook - A browser extension that captures web pages to local device or backend server for future retrieval, organization, annotation, and edit. This project inherits from legacy Firefox add-on ScrapBook X.
stream-detector - A Firefox addon for keeping track of manifests used by various streaming protocols and downloading media files.
koodo-reader - A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
FoxyRecon - A Firefox add-on for OSINT investigations