baseballr
tidycensus
baseballr | tidycensus | |
---|---|---|
16 | 13 | |
353 | 625 | |
- | - | |
7.4 | 7.5 | |
21 days ago | 14 days ago | |
R | R | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
baseballr
-
[General Discussion] Around the Horn - 12/11/23
A basic understanding of R should be enough if you install the baseball r package. From there you can scrape off of Baseball Reference or Fangraphs for custom date ranges to get stats on whatever time frame basis you would like. Then you can export/copy/whatever to excel if you want, or do the analysis right in R.
-
Are the 2023 Yankees too dependent on Judge (and maybe Stanton)? (a) Judge/Stanton Active: .562 W-L% in 16G, 4.8 R/G (b) Judge Active, Stanton IL: .636 W-L% in 33G, 5.0 R/G (c) Judge IL, Stanton Active: .438 W-L% in 16G, 3.4 R/G (d) Judge/Stanton IL: .400 W-L% in 10G, 3.5 R/G (Source: MLB Stats API)
Source: MLB Stats API via baseballr.
-
Scraping Minor League Stats?
I like this idea, too! I use baseballr all the time. It is a godsend.
-
Question on data scraping
In order to make it, I need to get every lineup from every game in the season. I am using the baseballr package to get the game_pk number. Each game has a game_pk number, and each lineup is tied to that game_pk. So I need to create a dataframe (all_games_list) for each game with its game_pk number in it, and then use the game_pk numbers to create a new dataframe (lineup_all) that contains the lineup for said game_pk.
-
Is their a stat or a program where I can see which pitchers during game deficits or leads, giving up a few runs due to walk walks, played hits, rbis? How would I go about filtering it out? I don’t mean starting pitchers or anything like that, I mean pitchers that came in one inning gave up 4 runs.
I just remembered there is also this R package: Acquiring and Analyzing Baseball Data • baseballr.
-
[Doyle] Multiple sources: The Seattle Mariners are calling up right-handed pitcher Bryce Miller. He will start Tuesday against Oakland.
To get all the data, I would suggest checking out baseballr if you are familiar with R. https://billpetti.github.io/baseballr/
-
[OC] The New MLB Pitch Clock is Fixing Baseball's Pace-of-Play Crisis
Visualization originally posted on my blog - I built the boxplot using R and ggplot2, and was fortunate to be able to use the excellent baseballr package to query MLB game information for the runtime source data!
-
Help!! Dataset required for Supervised Linear Regression | Learning purposes
baseballR (baseball)
-
MLB Stats API Application time?
most folks without direct access to mlb's api scrape baseball savant's data api. packages like baseballr or pybaseball can help with this. remember, this is in the open on a trust model: no commercial use, and don't hammer the api.
-
Where to get started analyzing basic baseball metrics
If you're using R, this is the gold standard package to use for getting baseball data. This helps you scrape data.
tidycensus
-
US county names dataset?
You could use the tidycensus package to get the information you need
- ACS Data in easily Digestable Format
-
Help!! Dataset required for Supervised Linear Regression | Learning purposes
the census (accessible using tidycensus or you can download bulk data straight from their website)
-
People who live near other people vote for Democrats
Data sources: Minnesota Secretary of State website, American Community Survey via tidycensus
- A blog post on learning R for spatial data science
-
Anyone know if there's a US Time Zones by Zip Code DB out there
If you happen to use R the tidycensus package is absolutely fantastic.
-
(MAPS) There are about 1,800 Ukrainian-born Hoosiers and 8,000 Hoosiers with Ukrainian ancestry
Thanks! I used the tidycensus package in RStudio https://walker-data.com/tidycensus/
-
Finding Usable US Census Data
R has a few packages for this and the first that comes to mind is tidycensus (https://walker-data.com/tidycensus/).
-
NJ Population mapping by block group
If you have some facility with R, then the 'tidycensus' package is your answer here. You can get the latest ACS (2019) data at the block group level with that. Alternatively, NHGIS from the IPUMS folks offers access to census tabular and spatial data in a straightforward format.
-
SF object pivot/spread results in improperly pivoted object (NA values)
This indicates the polygons are different - I would ASSUME that the shape for a US county does not change too much (or ever) so I feel like its an issue with the package [tidycensus](https://walker-data.com/tidycensus/).
What are some alternatives?
pybaseball - Pull current and historical baseball statistics using Python (Statcast, Baseball Reference, FanGraphs)
latlong - The latlong package maps from a latitude and longitude to a timezone.
boxball - Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.
sf - Simple Features for R
mlbplotR - R package to easily plot MLB logos
r4ds - R for data science: a book
ggplot2 - An implementation of the Grammar of Graphics in R
dplyr - dplyr: A grammar of data manipulation
baseballr - A package written for R focused on baseball analysis. Currently in development.
upm - ⠕ Universal Package Manager - Python, Node.js, Ruby, Emacs Lisp.
hoopR - An R package to quickly obtain clean and tidy men's basketball play by play data.