dom-distiller
ftr-site-config
Our great sponsors
dom-distiller | ftr-site-config | |
---|---|---|
3 | 13 | |
594 | 349 | |
- | - | |
0.0 | 9.5 | |
over 2 years ago | 6 days ago | |
Java | ||
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dom-distiller
- How does Firefox's Reader View work?
- The most underused browser feature
-
An app like Pocket to read articles and highlight?
The one ask you have that Literal doesn't yet support is read mode for sources (though it will automatically archive / backup sources). It looks like Chrome's read mode (i.e. the "Show simplified view" toolbar) is open source, so I think I could add support relatively quickly if you're interested.
ftr-site-config
-
can someone suggest a good rss reader for android please?
As far as full-text caching... maybe a self-hosted instance or paid version of the FiveFilters Full-Text RSS service would work. You can integrate that into whatever aggregator you want.
-
Help Finding the Best RSS App Mac/iOS
However you can retrofit this onto any reader by using a service that creates a full text feed from a summary feed. Two that I have used in the past are https://morss.it/ and https://www.fivefilters.org/full-text-rss/.
-
How to rebuild social media on top of RSS
RSS feeds that don't contain the full article text drive me nuts.
Here is a workaround that I've had good luck with:
https://www.fivefilters.org/full-text-rss/
In addition to improving usability, it defeats attempts to measure clickbait summary efficacy, etc., since it breaks sites' ability to pull popularity / telemetry info.
-
RSS-Bridge: feeds for websites that don't have one
By any chance, could this be used as an alternative to the full-article RSS tool that FiveFilters offers?
-
NetNewsWire: Free and Open Source RSS Reader for Mac and iOS
Please check out FullTextRSS from Five Filters: https://www.fivefilters.org/full-text-rss/
They have an OSS version you can host yourself. It fixes the problem of sites not sharing their full text in their feed, by going and scraping the site into a full feed for you.
-
Newsbite and seeing full articles
Full-Text RSS - FiveFilters.org
-
Show HN: Newser, utility written in go to generate a pdf with news content
This is great!
If it's useful, I work on a project where we maintain a repository of XPath selectors for extracting article content from many different sites: https://github.com/fivefilters/ftr-site-config - they're based on the original public Instapaper rules.
We also have PDF generation, but it's not really for crawling, and wasn't created for reading on a device like the Supernote, more for printing and reading: https://pdf.fivefilters.org/simple-print/
-
Best RSS experience?
To accomplish full-text I ended up purchasing a license for https://www.fivefilters.org/full-text-rss/, self host it and bounce it through a docker container running Tor+privproxy which generates a new circuit every 10 minutes to help avoid IP based limits on certain websites I subscribe to. I can also disable the Tor bounce per-feed if needed.
-
The most underused browser feature
Thanks for mentioning Instant View, I hadn't come across that. We actually maintain something similar here: https://github.com/fivefilters/ftr-site-config
We use these in our own tools and also get contributions from others, including Wallabag users: https://github.com/wallabag/wallabag
Before it was sold, Instapaper used to have something similar. A public database of its site-specific extraction templates. We used that as the starting point for our repository.
-
A 4 minute introduction to RSS
If you're trying to build one yourself, have a look at the open source Readability code[1]. It was originally developed by Arc90 and is now used by Apple and Mozilla in their browser reader views. The code has been ported to a number of different languages.
I work on a service called Full-Text RSS[2] that used a PHP port of Readability, coupled with site-specific extraction rules[3] to identify and extract article content from each feed item. It then produces a full-text version of the given feed. The idea is you subscribe to the full-text version in whichever feed reader you use and it will transparently give you full-text articles where you had partial content before.
[1] https://github.com/mozilla/readability
[2] https://www.fivefilters.org/full-text-rss/
[3] https://github.com/fivefilters/ftr-site-config
What are some alternatives?
readability - Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Experiment, it is now incorporated into Safari’s Reader View.
tridactyl - A Vim-like interface for Firefox, inspired by Vimperator/Pentadactyl.
parser - 📜 Extract meaningful content from the chaos of a web page
arc90-readability - A copy of the original Arc90 repo with links to many of the current ports.
unclutter - A modern reader mode and article library for your browser.
rssguard - Feed reader (and podcast player) which supports RSS/ATOM/JSON and many web-based feed services.
soup-strainer - A reimplementation of the Readability/Decruft algorithm using BeautifulSoup and html5lib
SponsorBlock - Skip YouTube video sponsors (browser extension)
einkbro - A small, fast web browser based on Android WebView. It's tailored for E-Ink devices but also works great on normal android devices.
readability - A standalone version of the readability lib
ALL-about-RSS - A list of RSS related stuff: tools, services, communities and tutorials, etc.