pybaseball
baseballr
Our great sponsors
pybaseball | baseballr | |
---|---|---|
33 | 16 | |
1,113 | 351 | |
- | - | |
5.0 | 7.4 | |
21 days ago | 12 days ago | |
Python | R | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pybaseball
- pybaseball help
- Best baseballdata source for datascraping
-
Baseball Savant API
pybaseball and MLB-StatsAPI are the go-to python wrappers for the official MLB API.
- Data sources for MLB ABs?
-
Looking for Spray Angle Data from this season (preferably)
I think pybaseball includes spray angle in their statcast() data frame! And if not, there definitely is the (x,y) location of where the ball landed, so you could probably do some trig to calculate that too.
-
Anyone willing to help create a webscrape for first pitch data?
Check out the docs here: https://github.com/jldbc/pybaseball/blob/master/docs/playerid_reverse_lookup.md
-
Python Code help
I think a lot of what you're looking to do though can be accomplished a bit easier by utilizing pre-existing Python libraries like Pybaseball.
- Help running pybaseball commands in Python
-
MLB Stats API Application time?
most folks without direct access to mlb's api scrape baseball savant's data api. packages like baseballr or pybaseball can help with this. remember, this is in the open on a trust model: no commercial use, and don't hammer the api.
-
Where to get started analyzing basic baseball metrics
And if you’re using Python, it’s pybaseball I believe. https://github.com/jldbc/pybaseball
baseballr
-
[General Discussion] Around the Horn - 12/11/23
A basic understanding of R should be enough if you install the baseball r package. From there you can scrape off of Baseball Reference or Fangraphs for custom date ranges to get stats on whatever time frame basis you would like. Then you can export/copy/whatever to excel if you want, or do the analysis right in R.
-
Are the 2023 Yankees too dependent on Judge (and maybe Stanton)? (a) Judge/Stanton Active: .562 W-L% in 16G, 4.8 R/G (b) Judge Active, Stanton IL: .636 W-L% in 33G, 5.0 R/G (c) Judge IL, Stanton Active: .438 W-L% in 16G, 3.4 R/G (d) Judge/Stanton IL: .400 W-L% in 10G, 3.5 R/G (Source: MLB Stats API)
Source: MLB Stats API via baseballr.
-
Scraping Minor League Stats?
I like this idea, too! I use baseballr all the time. It is a godsend.
-
Question on data scraping
In order to make it, I need to get every lineup from every game in the season. I am using the baseballr package to get the game_pk number. Each game has a game_pk number, and each lineup is tied to that game_pk. So I need to create a dataframe (all_games_list) for each game with its game_pk number in it, and then use the game_pk numbers to create a new dataframe (lineup_all) that contains the lineup for said game_pk.
-
Is their a stat or a program where I can see which pitchers during game deficits or leads, giving up a few runs due to walk walks, played hits, rbis? How would I go about filtering it out? I don’t mean starting pitchers or anything like that, I mean pitchers that came in one inning gave up 4 runs.
I just remembered there is also this R package: Acquiring and Analyzing Baseball Data • baseballr.
-
[Doyle] Multiple sources: The Seattle Mariners are calling up right-handed pitcher Bryce Miller. He will start Tuesday against Oakland.
To get all the data, I would suggest checking out baseballr if you are familiar with R. https://billpetti.github.io/baseballr/
-
[OC] The New MLB Pitch Clock is Fixing Baseball's Pace-of-Play Crisis
Visualization originally posted on my blog - I built the boxplot using R and ggplot2, and was fortunate to be able to use the excellent baseballr package to query MLB game information for the runtime source data!
-
Help!! Dataset required for Supervised Linear Regression | Learning purposes
baseballR (baseball)
-
MLB Stats API Application time?
most folks without direct access to mlb's api scrape baseball savant's data api. packages like baseballr or pybaseball can help with this. remember, this is in the open on a trust model: no commercial use, and don't hammer the api.
-
Where to get started analyzing basic baseball metrics
If you're using R, this is the gold standard package to use for getting baseball data. This helps you scrape data.
What are some alternatives?
MLB-StatsAPI - Python wrapper for MLB Stats API
boxball - Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.
mlbplotR - R package to easily plot MLB logos
sports.py - A simple Python package to gather live sports scores
ggplot2 - An implementation of the Grammar of Graphics in R
mara-pipelines - A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
baseballr - A package written for R focused on baseball analysis. Currently in development.
strat-o-rama - Generating plausible Strat-O-Matic cards from MLB data
tidycensus - Load US Census boundary and attribute data as 'tidyverse' and 'sf'-ready data frames in R
baseball-pi - Get the live box score, plays, and batter stats of your favorite MLB team right on your desktop.
upm - ⠕ Universal Package Manager - Python, Node.js, Ruby, Emacs Lisp.