web-poet
Web scraping Page Objects core library (by scrapinghub)
scrapy-fake-useragent
Random User-Agent middleware based on fake-useragent (by alecxe)
web-poet | scrapy-fake-useragent | |
---|---|---|
1 | 3 | |
89 | 681 | |
- | - | |
8.7 | 2.3 | |
about 2 months ago | 8 months ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
web-poet
Posts with mentions or reviews of web-poet.
We have used some of these posts to build our list of alternatives
and similar projects.
-
Is there a method to web scrape similar type of information from hundreds of websites with a single code or application?
Check out the web-poet pattern: https://github.com/scrapinghub/web-poet
scrapy-fake-useragent
Posts with mentions or reviews of scrapy-fake-useragent.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-10-14.
-
Looking for suggestions for a web scraper
User-Agents: Your user-agent list is pretty small, and you aren't adding the other headers that real browsers typically have. For a bigger list of user-agents you could use the scrapy-fake-user-agent middleware.
-
Apple AppStore Apps Dataset with 1.2 million apps
Use the following config Scrapy + https://github.com/aivarsk/scrapy-proxies + https://github.com/alecxe/scrapy-fake-useragent with a free random proxy list but beware of securing your database since (MongoDB) like are prone to ransomware attacks
What are some alternatives?
When comparing web-poet and scrapy-fake-useragent you can also consider the following projects:
dude - dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators
scrapy-playwright - 🎠Playwright integration for Scrapy