requests-html vs awesome-web-scraping

requests-html

Pythonic HTML Parsing for Humans™ (by kennethreitz)

Source Code

html.python-requests.org

Suggest alternative

Edit details

awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing. (by lorien)

web-scraping captcha-bypass captcha-recaptcha Crawling crawling-framework crawling-python crawling-tool Scraping scraping-framework scraping-python scraping-tool Webscraping Crawler Spider

Source Code

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

requests-html		awesome-web-scraping
	Project
2	Mentions	6
266	Stars	6,308
-	Growth	-
0.0	Activity	5.1
almost 2 years ago	Latest Commit	17 days ago
	Language	Makefile
MIT License	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

requests-html

Posts with mentions or reviews of requests-html. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-05-23.

Which string to lower case method to you use?
2 projects | /r/Python | 23 May 2022

Example: requests-html which has a rather exhaustive README.md, but their dedicated page is not that helpful, if I remember correctly, and currently the domain is suspended.
Problem reaching a link hidden deeply in the html
1 project | /r/webscraping | 14 Jun 2021

You can get through this by using requests_html to render the full page before trying to reach this url (Selenium works too but is even heavier).

awesome-web-scraping

Posts with mentions or reviews of awesome-web-scraping. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-03-09.

Ask HN: LinkedIn sent me a cease and desist for my Chrome extension. Help?
1 project | news.ycombinator.com | 31 Jan 2023

>I can scrape linkedin with a python script. That doesn't mean linkedin can shut down python.
Well said!
Also, what about copy-and-paste? The last time I checked, data could be highlighted in the browser, copied, and pasted...
Does that mean that LinkedIn can shut down the copy-and-paste capability of your browser and/or operating system?
What about "Save Page As..." functionality (the ability of a browser to save a page offline?)
Can LinkedIn shut down "Save Page As..." ?
Also, what about the Print Screen (take a screen snapshot) capabilities of your operating system?
Can LinkedIn shut down that?
Finally, there's literally oodles of software that can be used for web scraping; what follows below is just one non-canonical list:
https://github.com/lorien/awesome-web-scraping
Is LinkedIn going to shut down all of that, at the same time?
Anyway, an excellent point about Python!
Awesome-web-scraping – List of libraries, tools and APIs for web scraping
1 project | news.ycombinator.com | 31 Jan 2023
How does webscraping a website work and putting the data into my website?
1 project | /r/webscraping | 26 Jun 2022

Because at least for the scraping part there are open-source and paid services that will probably get you the data today if you need it (unless these are some really hard-to-scrape websites you're targeting) But if you are keen on learning yourself just scroll down this subreddit you will find many guides users shared along the years...
Russian Flag in Readme
2 projects | news.ycombinator.com | 9 Mar 2022

E.g. how would a Ukrainian dev feel having his project showcased in this list, under the Russian flag?
[0] https://github.com/lorien/awesome-web-scraping/issues/136
A central repository for scrapping scripts
1 project | /r/webscraping | 22 Feb 2021

What are some alternatives?

When comparing requests-html and awesome-web-scraping you can also consider the following projects:

requests-html - Pythonic HTML Parsing for Humans™

proxy-list - A list of free, public, forward proxy servers. UPDATED DAILY!

croncert-config - configuration and github actions for concertcloud.live (fka croncert.ch), a website that shows you concerts in various cities

Proxyman - Modern. Native. Delightful Web Debugging Proxy for macOS, iOS, and Android ⚡️

html2rss - 📰 Build RSS 2.0 feeds from websites (and JSON APIs) with a few CSS selectors.

awesome-micropython - A curated list of awesome MicroPython libraries, frameworks, software and resources.

TabNine - AI Code Completions

Awesome-Warez - All your base are belong to us!

syntax-highlighter - Syntax Highlighter extension for Visual Studio Code (VSCode). Based on Tree-sitter.

cookiecutter-poetry-pypackage - Cookiecutter template for poetry managed python package

bookmarks - :bookmark: :star: Collection of public dev bookmarks, shared with :heart: from www.codever.dev

2captcha-java - Java library for easy integration with the API of 2captcha captcha solving service to bypass recaptcha, hcaptcha, funcaptcha, geetest and solve any other captchas.

requests-html vs requests-html awesome-web-scraping vs proxy-list requests-html vs croncert-config awesome-web-scraping vs Proxyman requests-html vs html2rss awesome-web-scraping vs awesome-micropython awesome-web-scraping vs TabNine awesome-web-scraping vs Awesome-Warez awesome-web-scraping vs syntax-highlighter awesome-web-scraping vs cookiecutter-poetry-pypackage awesome-web-scraping vs bookmarks awesome-web-scraping vs 2captcha-java

Compare requests-html vs awesome-web-scraping and see what are their differences.

requests-html

awesome-web-scraping

requests-html

awesome-web-scraping

What are some alternatives?