weibo-scraper
scrapyrt
weibo-scraper | scrapyrt | |
---|---|---|
1 | 3 | |
93 | 817 | |
- | 0.4% | |
0.0 | 6.8 | |
about 2 months ago | 3 months ago | |
Python | Python | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
weibo-scraper
-
New User - Weibo scraper
I managed to set up everything thanks to this video https://www.youtube.com/watch?v=0DJQy8VKmMU&ab_channel=KameronKales and I am trying to use this https://github.com/Xarrow/weibo-scraper . My aim would be to scrap content based on keywords or hashtags and in a given period of time. I imagine I need some string of code as input, but I have no idea how.
scrapyrt
- New to python and scrapy stuff but need this project to work so that I can do my data research and stuff easily in the future.
-
Scrap data and create a Rest API
Alternatively if you want to use scrapy there's a brilliant API addition called scrapyRT which wraps http API on your scrapy project.
-
Scraping name and location info from Linkedin Profile URL using Apps scripts
Put ScrapyRT in place to expose the scraper via web service
What are some alternatives?
myGPTReader - A community-driven way to read and chat with AI bots - powered by chatGPT.
twisted-iocpsupport - `twisted-iocpsupport` is an extension module for the Twisted `iocp` reactor to use the Windows I/O Completion Ports (IOCP) networking API. You should not need to install it directly or interact with its API; it is a dependency of Twisted on Windows platforms.
weibo-crawler - 新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频
scrapy-proxycrawl-middleware - Scrapy middleware interface to scrape using ProxyCrawl proxy service
newspaper - newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
cryptoCMD - Cryptocurrency historical price data library in Python. Data from https://coinmarketcap.com.
Douyin_TikTok_Download_API - 🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
google-play-scraper - Google play scraper for Python inspired by <facundoolano/google-play-scraper>
courlan - Clean, filter and sample URLs to optimize data collection – includes spam, content type and language filters
jarchive-clues - Web crawler to collect Jeopardy! clues from https://j-archive.com
amazon_price_tracker - A cool Scrapy spider that notifies price drop in a product you crave to buy!
newspaperjs - News extraction and scraping. Article Parsing