Python Scrapy

Open-source Python projects categorized as Scrapy

Top 23 Python Scrapy Projects

  • scrapy-redis

    Redis-based components for Scrapy.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • Gerapy

    Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

  • scrapydweb

    Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. DEMO :point_right:

  • scrapy-splash

    Scrapy+Splash for JavaScript integration

  • SpiderKeeper

    admin ui for scrapy/open source scrapinghub

  • advertools

    advertools - online marketing productivity and analysis tools

  • scrapy-playwright

    🎭 Playwright integration for Scrapy

    Project mention: Current problems and mistakes of web scraping in Python and tricks to solve them! | dev.to | 2024-08-22

    Middleware libraries are written by the community and are extending their functionality. For example, scrapy-playwright.

  • scrapyrt

    HTTP API for Scrapy spiders

  • scrapy-rotating-proxies

    use multiple proxies with Scrapy

  • scrapy-fake-useragent

    Random User-Agent middleware based on fake-useragent

  • alltheplaces

    A set of spiders and scrapers to extract location information from places that post their location on the internet.

    Project mention: AllThePlaces.xyz | news.ycombinator.com | 2024-08-19

    An open web data scraping dataset of CC 0 licenced POI, written in python with the scrapy framework.

    https://github.com/alltheplaces/alltheplaces

  • estela

    estela, an elastic web scraping cluster 🕸

  • GoodreadsScraper

    Scrape data from Goodreads using Scrapy and Selenium :books:

  • scrapy-cloudflare-middleware

    A Scrapy middleware to bypass the CloudFlare's anti-bot protection

  • scrapy-crawl-once

    Scrapy middleware which allows to crawl only new content

  • open-gov-crawlers

    Parse government documents into well formed JSON

  • scrapy-mysql-pipeline

    scrapy mysql pipeline

  • scrapeops-scrapy-sdk

    Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the box.

  • scrapingant-client-python

    ScrapingAnt API client for Python.

  • burplist

    Web crawler for Burplist, a search engine for craft beers in Singapore

  • hltv-scraping

    Scraping data from hltv.org

  • nse-stock-scraper

    This is Web Scraper utilizing Scrapy Framework, MongoDB and AfricasTalking to get stock prices for companies listed on the Nairobi Stock Exchange. This project will store ticker name and price as well notify via SMS once properly setup via AfricasTalking.

  • scrapy-folder-tree

    A scrapy pipeline which stores files using folder trees.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Scrapy discussion

Log in or Post with

Python Scrapy related posts

  • AllThePlaces.xyz

    2 projects | news.ycombinator.com | 19 Aug 2024
  • Announcing Crawlee Python: Now you can use Python to build reliable web crawlers

    4 projects | dev.to | 9 Jul 2024
  • Web Scraping Dynamic Websites With Scrapy Playwright

    1 project | dev.to | 6 Mar 2024
  • Differentiating between hypermarkets and supermarkets.

    1 project | /r/openstreetmap | 9 Dec 2023
  • Meta, Microsoft and Amazon team up on maps project

    1 project | news.ycombinator.com | 26 Jul 2023
  • Distribution of gross and net salaries on r/BESalary [OC]

    1 project | /r/BESalary | 1 Jul 2023
  • How to make scrapy run multiple times on the same URLs?

    2 projects | /r/scrapy | 26 Jun 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 5 Dec 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Scrapy projects in Python? This list will help you:

Project Stars
1 scrapy-redis 5,543
2 Gerapy 3,359
3 scrapydweb 3,173
4 scrapy-splash 3,153
5 SpiderKeeper 2,738
6 advertools 1,159
7 scrapy-playwright 1,035
8 scrapyrt 837
9 scrapy-rotating-proxies 738
10 scrapy-fake-useragent 686
11 alltheplaces 636
12 estela 174
13 GoodreadsScraper 129
14 scrapy-cloudflare-middleware 105
15 scrapy-crawl-once 79
16 open-gov-crawlers 66
17 scrapy-mysql-pipeline 49
18 scrapeops-scrapy-sdk 37
19 scrapingant-client-python 36
20 burplist 13
21 hltv-scraping 10
22 nse-stock-scraper 10
23 scrapy-folder-tree 9

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com