ftr-site-config

Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications. (by fivefilters)

Ftr-site-config Alternatives

Similar projects and alternatives to ftr-site-config

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better ftr-site-config alternative or higher similarity.

Suggest an alternative to ftr-site-config

Reviews and mentions

Posts with mentions or reviews of ftr-site-config. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-08-25.
  • Best RSS experience?
    To accomplish full-text I ended up purchasing a license for https://www.fivefilters.org/full-text-rss/, self host it and bounce it through a docker container running Tor+privproxy which generates a new circuit every 10 minutes to help avoid IP based limits on certain websites I subscribe to. I can also disable the Tor bounce per-feed if needed.
  • The most underused browser feature
    news.ycombinator.com | 2021-08-25
    Thanks for mentioning Instant View, I hadn't come across that. We actually maintain something similar here: https://github.com/fivefilters/ftr-site-config

    We use these in our own tools and also get contributions from others, including Wallabag users: https://github.com/wallabag/wallabag

    Before it was sold, Instapaper used to have something similar. A public database of its site-specific extraction templates. We used that as the starting point for our repository.

  • A 4 minute introduction to RSS
    news.ycombinator.com | 2021-07-02
    If you're trying to build one yourself, have a look at the open source Readability code[1]. It was originally developed by Arc90 and is now used by Apple and Mozilla in their browser reader views. The code has been ported to a number of different languages.

    I work on a service called Full-Text RSS[2] that used a PHP port of Readability, coupled with site-specific extraction rules[3] to identify and extract article content from each feed item. It then produces a full-text version of the given feed. The idea is you subscribe to the full-text version in whichever feed reader you use and it will transparently give you full-text articles where you had partial content before.

    [1] https://github.com/mozilla/readability

    [2] https://www.fivefilters.org/full-text-rss/

    [3] https://github.com/fivefilters/ftr-site-config

  • Which iOS app is best for grabbing news and RSS feeds when you have infrequent connectivity?
    reddit.com/r/rss | 2021-05-06
    Good to hear, and for problem feeds we're happy to look into reported cases to see if we can improve extraction. On top of automatic detection, we maintain a respository of site-specific extraction rules. These get updated as we detect problems or receive reports of sites where extraction can be improved.
  • RIP Google Reader
    news.ycombinator.com | 2021-03-25
    I work on a project to transform partial feeds into full-text versions. The idea is you give it the partial feeds URL and subscribe to the feed URL it generates: https://www.fivefilters.org/full-text-rss/
  • RSS Feed with Inline Images for Funnyjunk.com?
    reddit.com/r/rss | 2021-03-13
    In this case, we've just added a site-specific extraction file: https://github.com/fivefilters/ftr-site-config/blob/master/funnyjunk.com.txt

Stats

Basic ftr-site-config repo stats
6
255
9.2
about 17 hours ago

fivefilters/ftr-site-config is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
Find remote jobs at our new job board 99remotejobs.com.
There are 34 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.