requests-html VS feedparser

Compare requests-html vs feedparser and see what are their differences.

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
requests-html feedparser
14 7
13,806 2,096
0.2% 1.9%
0.0 7.6
about 1 year ago 23 days ago
Python Python
MIT License BSD 2-clause "Simplified" License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

requests-html

Posts with mentions or reviews of requests-html. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-13.

feedparser

Posts with mentions or reviews of feedparser. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-11-12.
  • What I Wish Someone Told Me About Postgres
    5 projects | news.ycombinator.com | 12 Nov 2024
    i am using the feedparser library in python https://github.com/kurtmckee/feedparser/ which basically takes an RSS url and standardizes it to a reasonable extent. But I have noticed that different websites still get parsed slightly differently. For example look at how https://beincrypto.com/feed/ has a long description (containing actual HTML) inside but this website https://www.coindesk.com/arc/outboundfeeds/rss/ completely cuts the description out. I have about 50 such websites and they all have slight variations. So you are saying that in addition to storing parsed data (title, summary, content, author, pubdate, link, guid) that I currently store, I should also add an xml column and store the raw from each url till I get a good hang of how each site differs?
  • RSS can be used to distribute all sorts of information
    9 projects | news.ycombinator.com | 20 Nov 2023
    There is JSON Feed¹ already. One of the spec writers is behind micro.blog, which is the first place I saw it(and also one of the few places I've seen it). I don't think it is a bad idea, and it doesn't take all that long to implement it.

    I have long hoped it would pick up with the JSON-ify everything crowd, just so I'd never see a non-Atom feed again. We perhaps wouldn't need sooo much of the magic that is wrapped up in packages like feedparser² to deal with all the brokeness of RSS in the wild then.

    ¹ https://www.jsonfeed.org/

    ² https://github.com/kurtmckee/feedparser

  • Help! trying to use scraping for my dissertation but I am clueless
    1 project | /r/webscraping | 6 Jul 2023
    What sites did you try? Looked into RSS yet? Many sites have RSS feeds you can use with something like https://github.com/kurtmckee/feedparser nytimes.com feeds: https://www.nytimes.com/rss
  • Newb learning GitHub & Python. Projects?
    4 projects | /r/github | 22 Jan 2023
    feedparser
  • Python Library to scrape RSS-Feeds from waybackmachine?
    1 project | /r/webscraping | 28 Dec 2022
    You can explore FeedParser too
  • looking for a project
    3 projects | /r/github | 22 Dec 2022
    feedparser is a python package receiving and parsing RSS/Atom newsfeeds. The maintainer is active but really need much more support.
  • Consulta de un Novato absoluto
    2 projects | /r/PythonEspanol | 21 Feb 2022
    Lo más sencillo que conozco para monitorizar canales de YouTube son los feeds RSS que tiene cada canal. El formato es https://www.youtube.com/feeds/videos.xml?channel_id=[CHANNEL_ID]. Si no conoces RSS, echa un vistazo en la wiki. Para leer RSSs en Python tienes feedparser (y seguramente muchas más).

What are some alternatives?

When comparing requests-html and feedparser you can also consider the following projects:

Scrapy - Scrapy, a fast high-level web crawling & scraping framework for Python.

pyspider - A Powerful Spider(Web Crawler) System in Python.

MechanicalSoup - A Python library for automating interaction with websites.

RoboBrowser

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured

Did you know that Python is
the 2nd most popular programming language
based on number of references?