scrapy-fake-useragent
viviner
scrapy-fake-useragent | viviner | |
---|---|---|
3 | 1 | |
689 | 50 | |
- | - | |
2.3 | 2.4 | |
over 1 year ago | over 1 year ago | |
Python | Python | |
MIT License | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
scrapy-fake-useragent
-
Looking for suggestions for a web scraper
User-Agents: Your user-agent list is pretty small, and you aren't adding the other headers that real browsers typically have. For a bigger list of user-agents you could use the scrapy-fake-user-agent middleware.
-
Apple AppStore Apps Dataset with 1.2 million apps
Use the following config Scrapy + https://github.com/aivarsk/scrapy-proxies + https://github.com/alecxe/scrapy-fake-useragent with a free random proxy list but beware of securing your database since (MongoDB) like are prone to ransomware attacks
viviner
-
For Wine Lovers - I scraped wine data from Everly Wine Shop and compared it with Vivino ratings
Used python to scrape data from Everly (https://wineshop.theeverly.ca/), and used this person's Vivino data scraper to cross reference. https://github.com/gugarosa/viviner
What are some alternatives?
scrapy-playwright - 🎭 Playwright integration for Scrapy
autoscraper - A Smart, Automatic, Fast and Lightweight Web Scraper for Python
scrapy-rotating-proxies - use multiple proxies with Scrapy
WikiMapper - Create maps of wiki links on how they interconnect with each other.
scrapy-splash - Scrapy+Splash for JavaScript integration
yars - Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk download any user's data.
hltv-scraping - Scraping data from hltv.org
Webtap.ai - AI web scraping python library for efficient and reliable web scraping.
scrapy-iltasanomat-kuntavaalit2021 - Fetch Sanoma kuntavaalit 2021 data
google-finance-py - Scripts for scraping Google Finance data in Python.
fareview - A simple market price monitor for commercial beers in Singapore
GoodreadsScraper - Scrape data from Goodreads using Scrapy and Selenium :books: