arc90-readability
readability.php
arc90-readability | readability.php | |
---|---|---|
4 | 3 | |
202 | 208 | |
- | - | |
0.0 | 6.3 | |
about 2 years ago | 11 days ago | |
PHP | HTML | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
arc90-readability
- How do Instapaper and Pocket apps extract the content of the articles?
-
How does Firefox's Reader View work?
For those wondering if there's a redability lib in their favorite language. Here's a list of them all (as far as i know) plus the original arc-90 implementation
https://github.com/masukomi/arc90-readability/#readability
Please submit a PR if there's something i don't have listed there.
-
Show HN: Lurnby, a tool for better learning, is now open source
Huh, you are correct. I guess a better way to put this is "the original Readability I encountered was in Python"! The first version I saw was in Aaron Swartz's 2012 read2text tool, but a check of the URL I found that through says, yup, it's a Python port of Arc90's original code, which was a browser extension.
And you're right. It was in JavaScript. I finally tracked a copy down (the original is long evaporated): https://github.com/masukomi/arc90-readability/blob/master/js...
- The most underused browser feature
readability.php
-
Gripes with RSS after one week
There are RSS readers like Tiny Tiny RSS [1] which are able to do exactly that (in this case using a PHP port of Mozilla's library [2]). Does not work in 100% of cases but is a really useful thing.
[1] https://tt-rss.org/
[2] https://github.com/fivefilters/readability.php
-
Nobel prize to the people behind reader mode
https://github.com/fivefilters/readability.php is port of it, and it's server backend support, ported from https://github.com/mozilla/readability
-
The most underused browser feature
Any developers who'd like to contribute to improving how article content is extracted from web pages should check out Mozilla's Readability repository: https://github.com/mozilla/readability
I'm currently trying to bring the PHP port up to speed here: https://github.com/fivefilters/readability.php
We use currently use an older version as part of our article extraction for Push to Kindle: https://www.fivefilters.org/push-to-kindle/
What are some alternatives?
Just-Read - A customizable read mode web extension.
readability - A standalone version of the readability lib
parser - 📜 Extract meaningful content from the chaos of a web page
tidy-html5 - The granddaddy of HTML tools, with support for modern standards
awesome-reMarkable - A curated list of projects related to the reMarkable tablet
tranquility-reader-webextensions - Tranquility Reader rewritten using Webextensions API
ftr-site-config - Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
tridactyl - A Vim-like interface for Firefox, inspired by Vimperator/Pentadactyl.
SponsorBlock - Skip YouTube video sponsors (browser extension)
unclutter - A modern reader mode and article library for your browser.