scrapy-fake-useragent VS scrapy-rotating-proxies

Compare scrapy-fake-useragent vs scrapy-rotating-proxies and see what are their differences.

scrapy-fake-useragent

Random User-Agent middleware based on fake-useragent (by alecxe)

scrapy-rotating-proxies

use multiple proxies with Scrapy (by TeamHG-Memex)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
scrapy-fake-useragent scrapy-rotating-proxies
3 4
681 705
- 0.0%
2.3 0.0
8 months ago almost 2 years ago
Python Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

scrapy-fake-useragent

Posts with mentions or reviews of scrapy-fake-useragent. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-10-14.
  • Looking for suggestions for a web scraper
    1 project | /r/learnpython | 1 Sep 2022
    User-Agents: Your user-agent list is pretty small, and you aren't adding the other headers that real browsers typically have. For a bigger list of user-agents you could use the scrapy-fake-user-agent middleware.
  • Apple AppStore Apps Dataset with 1.2 million apps
    6 projects | /r/datasets | 14 Oct 2021
    Use the following config Scrapy + https://github.com/aivarsk/scrapy-proxies + https://github.com/alecxe/scrapy-fake-useragent with a free random proxy list but beware of securing your database since (MongoDB) like are prone to ransomware attacks

scrapy-rotating-proxies

Posts with mentions or reviews of scrapy-rotating-proxies. We have used some of these posts to build our list of alternatives and similar projects.
  • How do you handle CAPTCHA pages appearing in some of the rotating proxies you use?
    1 project | /r/webscraping | 13 Apr 2023
    It was the sliding CAPTCHA but I solved it by following the instructions from the library I'm using to rotate proxies to retry with a different IP when there is a CAPTCHA https://github.com/TeamHG-Memex/scrapy-rotating-proxies At the bottom if anyone is interested
  • Scrapy rotating proxies
    1 project | /r/webscraping | 1 Aug 2022
    Hi, I've been using the scrapy-rotating-proxies (https://github.com/TeamHG-Memex/scrapy-rotating-proxies) library for scrapy and I'm getting logs in my crawl of type example: "[rotating_proxies.expire] DEBUG: Proxy is DEAD. When I check and test the proxies (I'm using webshare proxies) and urls mentioned on the logs individually they work ok, so I assume it's a problem with the library, has anyone had the same issue of similar problem? (I looked for tickets reported on github but had didn't find any refering to this.
  • how does one configure webshare api key in scrapy scripts and also to use scrapy-proxy-pool?
    1 project | /r/scrapy | 21 Dec 2021
    Scrapy takes the proxy from the http_proxy/https_proxy env vars. They can include the user/password. As for pools, Scrapy itself doesn't support that, but you can use https://github.com/TeamHG-Memex/scrapy-rotating-proxies or similar addons to use them.
  • Using free proxies for a spider.
    1 project | /r/scrapy | 2 Jul 2021
    Hello, I'm looking into trying free proxies using something like in this github (https://github.com/TeamHG-Memex/scrapy-rotating-proxies/blob/master/README.rst). However, I need to find my own list of proxy IP's to use. When I look up free proxies I find plenty of options, but I'm rather new to this topic and don't know what to use. There seems to be plenty of different types, and I'm not sure if I should/shouldn't use certain proxy IP's. Any advice on the topic would be appreciated.

What are some alternatives?

When comparing scrapy-fake-useragent and scrapy-rotating-proxies you can also consider the following projects:

scrapy-playwright - 🎭 Playwright integration for Scrapy

scrapy-splash - Scrapy+Splash for JavaScript integration

scrapy-cloudflare-middleware - A Scrapy middleware to bypass the CloudFlare's anti-bot protection

WikiMapper - Create maps of wiki links on how they interconnect with each other.

hltv-scraping - Scraping data from hltv.org

Shadowrocket-ADBlock-Rules - 提供多款 Shadowrocket 规则,带广告过滤功能。用于 iOS 未越狱设备选择性地自动翻墙。

viviner - 🍷 Scraps data from Vivino and collects outstanding wine-based meta-data.

scrapy-crawl-once - Scrapy middleware which allows to crawl only new content

webscraping-from-0-to-hero - The web scraping open project repository aims to share knowledge and experiences about web scraping with Python

scrapydweb - Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. DEMO :point_right: