Rcrawler
An R web crawler and scraper (by salimk)
imdb
Web-scraping and data visualization of IMDb's most popular movies in 2018 (by lorenanda)
Our great sponsors
Rcrawler | imdb | |
---|---|---|
2 | 1 | |
344 | 0 | |
- | - | |
0.0 | 0.0 | |
about 2 years ago | over 3 years ago | |
R | R | |
GNU General Public License v3.0 or later | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Rcrawler
Posts with mentions or reviews of Rcrawler.
We have used some of these posts to build our list of alternatives
and similar projects.
-
Can R do recursive web crawling?
According to their GitHub page, it should be able to:
-
increasing scraping speed
I'm not an R expert, but here are a few links about concurrent web scraping in R: RCurl, rvest + furrr, RCrawler.
imdb
Posts with mentions or reviews of imdb.
We have used some of these posts to build our list of alternatives
and similar projects.
-
Web-scraping IMDb with R
There are many no-code tools for web scraping, like browser plug-ins (e.g. Webscraper) and software (e.g. Parsehub). However, if you need more advanced scraping settings and have basic coding skills, I recommend the Python libraries Beautiful Soup or Selenium, and the R package rvest. The latter is the one I used for scraping IMDb and you can find the commented code on my GitHub.
What are some alternatives?
When comparing Rcrawler and imdb you can also consider the following projects:
r-web-scraping-cheat-sheet - Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.
scrapingant-client-python - ScrapingAnt API client for Python.
crypto - Cryptocurrency Historical Market Data R Package
polite - Be nice on the web
RedditExtractor - A minimalistic R wrapper for the Reddit API