ricecooker
url-to-epub
ricecooker | url-to-epub | |
---|---|---|
1 | 1 | |
18 | 13 | |
- | - | |
8.2 | 0.0 | |
8 days ago | almost 2 years ago | |
Python | JavaScript | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ricecooker
-
Reading from the web offline and distraction-free
Very cool.
The take-any-webpage-offline need is also common in the education space (teachers want to save a webpage and send it to their students as part of a lesson and don't want to worry about availability or ads etc).
I used to work on tools for this https://github.com/learningequality/ricecooker/blob/develop/... and https://github.com/learningequality/BasicCrawler/blob/master...
url-to-epub
-
Reading from the web offline and distraction-free
I do a lot of this work[3] (web to documents) and it's interesting to see other approaches. The medium image problem is something I've faced as well, but never got around to fixing. I'm planning to get a Remarkable soon, so will definitely be trying this out.
My personal solution has been https://github.com/captn3m0/url-to-epub/ (Node/readability), which I've tested against the entirety of Tor's original fiction collection[0] where it performs well enough (I'm biased). Another tool that does this beautifully well is percollate[1], but it doesn't give enough control of the metadata to the user - something I really care about.
I've also started to use rdrview[2], which is a C-port of the current Firefox implementation of "reader view". It is very unix-y, so it is easy to pipe content to it (I usually run it through tidy first). Quite helpful in building web-archiving or web-to-pdf or web-to-kindle pipelines easily.
[0]: https://www.tor.com/category/all-fiction/original-fiction/
[1]: https://github.com/danburzo/percollate
[2]: https://github.com/eafer/rdrview
[3]: https://captnemo.in/ebooks/
What are some alternatives?
zimit - Make a ZIM file from any Web site and surf offline!
percollate - A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
rdrview - Firefox Reader View as a command line tool
parser - 📜 Extract meaningful content from the chaos of a web page