dom-distiller
readability.php
Our great sponsors
dom-distiller | readability.php | |
---|---|---|
3 | 3 | |
594 | 208 | |
- | - | |
0.0 | 4.8 | |
over 2 years ago | 10 months ago | |
Java | HTML | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dom-distiller
- How does Firefox's Reader View work?
- The most underused browser feature
-
An app like Pocket to read articles and highlight?
The one ask you have that Literal doesn't yet support is read mode for sources (though it will automatically archive / backup sources). It looks like Chrome's read mode (i.e. the "Show simplified view" toolbar) is open source, so I think I could add support relatively quickly if you're interested.
readability.php
-
Gripes with RSS after one week
There are RSS readers like Tiny Tiny RSS [1] which are able to do exactly that (in this case using a PHP port of Mozilla's library [2]). Does not work in 100% of cases but is a really useful thing.
[1] https://tt-rss.org/
[2] https://github.com/fivefilters/readability.php
-
Nobel prize to the people behind reader mode
https://github.com/fivefilters/readability.php is port of it, and it's server backend support, ported from https://github.com/mozilla/readability
-
The most underused browser feature
Any developers who'd like to contribute to improving how article content is extracted from web pages should check out Mozilla's Readability repository: https://github.com/mozilla/readability
I'm currently trying to bring the PHP port up to speed here: https://github.com/fivefilters/readability.php
We use currently use an older version as part of our article extraction for Push to Kindle: https://www.fivefilters.org/push-to-kindle/
What are some alternatives?
readability - Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Experiment, it is now incorporated into Safari’s Reader View.
readability - A standalone version of the readability lib
ftr-site-config - Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
tidy-html5 - The granddaddy of HTML tools, with support for modern standards
parser - 📜 Extract meaningful content from the chaos of a web page
tranquility-reader-webextensions - Tranquility Reader rewritten using Webextensions API
unclutter - A modern reader mode and article library for your browser.
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
soup-strainer - A reimplementation of the Readability/Decruft algorithm using BeautifulSoup and html5lib
SponsorBlock - Skip YouTube video sponsors (browser extension)
einkbro - A small, fast web browser based on Android WebView. It's tailored for E-Ink devices but also works great on normal android devices.
tridactyl - A Vim-like interface for Firefox, inspired by Vimperator/Pentadactyl.