imdb
Web-scraping and data visualization of IMDb's most popular movies in 2018 (by lorenanda)
Rcrawler
An R web crawler and scraper (by salimk)
imdb | Rcrawler | |
---|---|---|
1 | 2 | |
0 | 344 | |
- | - | |
0.0 | 0.0 | |
over 3 years ago | about 2 years ago | |
R | R | |
MIT License | GNU General Public License v3.0 or later |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
imdb
Posts with mentions or reviews of imdb.
We have used some of these posts to build our list of alternatives
and similar projects.
-
Web-scraping IMDb with R
There are many no-code tools for web scraping, like browser plug-ins (e.g. Webscraper) and software (e.g. Parsehub). However, if you need more advanced scraping settings and have basic coding skills, I recommend the Python libraries Beautiful Soup or Selenium, and the R package rvest. The latter is the one I used for scraping IMDb and you can find the commented code on my GitHub.
Rcrawler
Posts with mentions or reviews of Rcrawler.
We have used some of these posts to build our list of alternatives
and similar projects.
-
Can R do recursive web crawling?
According to their GitHub page, it should be able to:
-
increasing scraping speed
I'm not an R expert, but here are a few links about concurrent web scraping in R: RCurl, rvest + furrr, RCrawler.
What are some alternatives?
When comparing imdb and Rcrawler you can also consider the following projects:
r-web-scraping-cheat-sheet - Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.
scrapingant-client-python - ScrapingAnt API client for Python.
crypto - Cryptocurrency Historical Market Data R Package
polite - Be nice on the web
RedditExtractor - A minimalistic R wrapper for the Reddit API