Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Scrapy Open-Source Projects
-
crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
scrapydweb
Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. DEMO :point_right:
-
webscraping-from-0-to-hero
The web scraping open project repository aims to share knowledge and experiences about web scraping with Python
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
fakebrowser
🤖 Fake fingerprints to bypass anti-bot systems. Simulate mouse and keyboard operations to make behavior like a real person.
-
kimuraframework
Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites
-
alltheplaces
A set of spiders and scrapers to extract location information from places that post their location on the internet.
-
Netflix-Clone
Netflix like full-stack application with SPA client and backend implemented in service oriented architecture (by yuchiu)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Web Scraping from 0 to hero – Sharing knowledge about web scraping on GH | news.ycombinator.com | 2023-07-06
scrapy-playwright is an integration between Scrapy and Playwright. It enables scraping dynamic web pages with Scrapy by processing the web scraping requests using a Playwright instance.
Project mention: Differentiating between hypermarkets and supermarkets. | /r/openstreetmap | 2023-12-09Maybe a different approach? https://www.alltheplaces.xyz/ has stores grouped by name
Project mention: Tanakai: Modern web scraping framework written in Ruby | news.ycombinator.com | 2023-10-25
Scrapy related posts
-
Web Scraping Dynamic Websites With Scrapy Playwright
-
Differentiating between hypermarkets and supermarkets.
-
Tanakai: Modern web scraping framework written in Ruby
-
Meta, Microsoft and Amazon team up on maps project
-
Distribution of gross and net salaries on r/BESalary [OC]
-
How to make scrapy run multiple times on the same URLs?
-
There are only 2 .yahoo Internet domains
-
A note from our sponsor - InfluxDB
www.influxdata.com | 4 May 2024
Index
What are some of the best open-source Scrapy projects? This list will help you:
Project | Stars | |
---|---|---|
1 | crawlab | 10,803 |
2 | scrapy-redis | 5,454 |
3 | Gerapy | 3,215 |
4 | scrapy-splash | 3,051 |
5 | scrapydweb | 3,004 |
6 | SpiderKeeper | 2,705 |
7 | webscraping-from-0-to-hero | 1,457 |
8 | advertools | 1,058 |
9 | fakebrowser | 1,048 |
10 | kimuraframework | 999 |
11 | scrapy-playwright | 837 |
12 | scrapyrt | 814 |
13 | Data-Engineering-Projects | 722 |
14 | scrapy-rotating-proxies | 705 |
15 | scrapy-fake-useragent | 681 |
16 | domains | 640 |
17 | alltheplaces | 559 |
18 | PHP Scraper | 497 |
19 | Netflix-Clone | 263 |
20 | tanakai | 260 |
21 | awesome-web-scraper | 237 |
22 | estela | 154 |
23 | GoodreadsScraper | 115 |
Sponsored