ftr-site-config VS hnrss

Compare ftr-site-config vs hnrss and see what are their differences.

ftr-site-config

Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications. (by fivefilters)

hnrss

Custom, realtime RSS feeds for Hacker News (by hnrss)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
ftr-site-config hnrss
13 68
349 488
- 0.4%
9.5 3.5
7 days ago 2 months ago
Go
GNU General Public License v3.0 or later -
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

ftr-site-config

Posts with mentions or reviews of ftr-site-config. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-12.
  • can someone suggest a good rss reader for android please?
    2 projects | /r/rss | 12 Jul 2023
    As far as full-text caching... maybe a self-hosted instance or paid version of the FiveFilters Full-Text RSS service would work. You can integrate that into whatever aggregator you want.
  • Help Finding the Best RSS App Mac/iOS
    2 projects | /r/rss | 4 Mar 2023
    However you can retrofit this onto any reader by using a service that creates a full text feed from a summary feed. Two that I have used in the past are https://morss.it/ and https://www.fivefilters.org/full-text-rss/.
  • How to rebuild social media on top of RSS
    3 projects | news.ycombinator.com | 13 Dec 2022
    RSS feeds that don't contain the full article text drive me nuts.

    Here is a workaround that I've had good luck with:

    https://www.fivefilters.org/full-text-rss/

    In addition to improving usability, it defeats attempts to measure clickbait summary efficacy, etc., since it breaks sites' ability to pull popularity / telemetry info.

  • RSS-Bridge: feeds for websites that don't have one
    3 projects | /r/rss | 17 Nov 2022
    By any chance, could this be used as an alternative to the full-article RSS tool that FiveFilters offers?
  • NetNewsWire: Free and Open Source RSS Reader for Mac and iOS
    5 projects | news.ycombinator.com | 1 Oct 2022
    Please check out FullTextRSS from Five Filters: https://www.fivefilters.org/full-text-rss/

    They have an OSS version you can host yourself. It fixes the problem of sites not sharing their full text in their feed, by going and scraping the site into a full feed for you.

  • Newsbite and seeing full articles
    2 projects | /r/rss | 11 Jul 2022
    Full-Text RSS - FiveFilters.org
  • Show HN: Newser, utility written in go to generate a pdf with news content
    3 projects | news.ycombinator.com | 20 Feb 2022
    This is great!

    If it's useful, I work on a project where we maintain a repository of XPath selectors for extracting article content from many different sites: https://github.com/fivefilters/ftr-site-config - they're based on the original public Instapaper rules.

    We also have PDF generation, but it's not really for crawling, and wasn't created for reading on a device like the Supernote, more for printing and reading: https://pdf.fivefilters.org/simple-print/

  • Best RSS experience?
    1 project | /r/selfhosted | 29 Aug 2021
    To accomplish full-text I ended up purchasing a license for https://www.fivefilters.org/full-text-rss/, self host it and bounce it through a docker container running Tor+privproxy which generates a new circuit every 10 minutes to help avoid IP based limits on certain websites I subscribe to. I can also disable the Tor bounce per-feed if needed.
  • The most underused browser feature
    22 projects | news.ycombinator.com | 25 Aug 2021
    Thanks for mentioning Instant View, I hadn't come across that. We actually maintain something similar here: https://github.com/fivefilters/ftr-site-config

    We use these in our own tools and also get contributions from others, including Wallabag users: https://github.com/wallabag/wallabag

    Before it was sold, Instapaper used to have something similar. A public database of its site-specific extraction templates. We used that as the starting point for our repository.

  • A 4 minute introduction to RSS
    9 projects | news.ycombinator.com | 2 Jul 2021
    If you're trying to build one yourself, have a look at the open source Readability code[1]. It was originally developed by Arc90 and is now used by Apple and Mozilla in their browser reader views. The code has been ported to a number of different languages.

    I work on a service called Full-Text RSS[2] that used a PHP port of Readability, coupled with site-specific extraction rules[3] to identify and extract article content from each feed item. It then produces a full-text version of the given feed. The idea is you subscribe to the full-text version in whichever feed reader you use and it will transparently give you full-text articles where you had partial content before.

    [1] https://github.com/mozilla/readability

    [2] https://www.fivefilters.org/full-text-rss/

    [3] https://github.com/fivefilters/ftr-site-config

hnrss

Posts with mentions or reviews of hnrss. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-01.
  • Ask HN: Have you reduced technical knowledge contributions?
    1 project | news.ycombinator.com | 24 Mar 2024
    That’s interesting.

    I have predictive models that can predict if a headline (w/o the rest of the article and not considering the URL) will (a) get more than 10 votes and (b) if it does get more than 10 votes will the votes/comments ratio be more than 2 (which is roughly average)

    The first model gets a ROC-AUC (see https://scikit-learn.org/stable/modules/generated/sklearn.me...) in the low 60’s (not good, the second model gets in the low 70’s (actually pretty good though it is a heat seeking missile for clickbait headlines) and my latest content-based recommender for RSS items gets almost 80. (I saw a paper that one system at TikTok gets about 85)

    To do all that you need about 10,000 headlines and don’t get a lot of benefit from having more than 100,000. The ceilings on performance have more to do with the nature of the problem rather than my models: the same article can get submitted twice and get 0 votes one time and 200 the other time so it can never be as accurate as “is this an article about galactic astronomy?”

    I had it ingest the HN comments firehose and found the amount of articles was overwhelming, my YOShInOn RSS reader now ingests the “best comments” from

    https://hnrss.github.io/

    together with 110 other feeds and actually I like the comments it picks out a lot. Now that the system is adding about 3000 items per day it might be able to handle a big feed like the comments firehose since now those comments are diluted with so many quality articles. For a problem like that you might want a two-score system with: (i) is it relevant? (something I like) and (ii) is it popular? (like Google’s PageRank)

    I think you could make a model that compares comments in the best comments feed with other comments. I have tried formulating the problems above as regression problems where I try to predict the actual score and it does not work well because of the uncertainty problem but formulated as a classification problem for a score over a threshold it is easy to make a well-calibrated model that tells you “this article has a 20% chance of frontpaging” which is about the best anyone can do.

  • Ask HN: How can I get rid of addiction to HN?
    1 project | news.ycombinator.com | 17 Mar 2024
    Subscribe via rss, so you can scratch the curiosity itch and each the FOMO, without coming to the site all the time and looking over the same things 20 times?

    https://hnrss.github.io/

  • Show HN: Hacker News Outliers
    1 project | news.ycombinator.com | 18 Feb 2024
  • Ask HN: Is There an HN Reader and Filter?
    1 project | news.ycombinator.com | 15 Jan 2024
    https://news.ycombinator.com/item?id=9491978

    and this https://hnrss.github.io/

    ps i’m ok with some % of false positives, but hopefully a sprinkle of OpenAI could keep that magically low?

    thanks

  • Orange Site Hit
    4 projects | news.ycombinator.com | 1 Jan 2024
  • RSS can be used to distribute all sorts of information
    9 projects | news.ycombinator.com | 20 Nov 2023
    It sounds interesting but I use https://hnrss.github.io/

    Unless it had most of the features of hnrss.org I would not be able to use it.

    Perhaps you could pivot your approach and submit a PR to hnrss for the feature?

  • Ask HN: Who is hiring? (October 2023)
    9 projects | news.ycombinator.com | 2 Oct 2023
  • Tell HN: There is a new highlights page on HN
    1 project | news.ycombinator.com | 25 Sep 2023
    Looks like there's an unmerged PR on the third-party hnrss project that would add this: https://github.com/hnrss/hnrss/pull/84
  • Why your blog still needs RSS
    9 projects | news.ycombinator.com | 19 Aug 2023
    Check out below link to get a more customized, topic wise rss feeds.

    https://hnrss.github.io/

  • Ask HN: Is there a way to “filter” the posts on HN
    1 project | news.ycombinator.com | 25 Jul 2023

What are some alternatives?

When comparing ftr-site-config and hnrss you can also consider the following projects:

tridactyl - A Vim-like interface for Firefox, inspired by Vimperator/Pentadactyl.

rss-proxy - RSS-proxy allows you to do create an RSS or ATOM feed of almost any website, just by analyzing just the static HTML structure.

dom-distiller - Distills the DOM

newsboat - An RSS/Atom feed reader for text terminals

rssguard - Feed reader (and podcast player) which supports RSS/ATOM/JSON and many web-based feed services.

hackernews-TUI - A Terminal UI to browse Hacker News

arc90-readability - A copy of the original Arc90 repo with links to many of the current ports.

fraidycat - Follow blogs, wikis, YouTube channels, as well as accounts on Twitter, Instagram, etc. from a single page.

SponsorBlock - Skip YouTube video sponsors (browser extension)

ALL-about-RSS - A list of RSS related stuff: tools, services, communities and tutorials, etc.

readability - A standalone version of the readability lib

Hacker News API - Documentation and Samples for the Official HN API