boxball
beisbol-analitica
boxball | beisbol-analitica | |
---|---|---|
7 | 10 | |
110 | 23 | |
- | - | |
5.5 | 0.0 | |
5 months ago | about 1 year ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
boxball
-
Importing Retrosheet to Tableau or Power BI
I haven't worked with BI tools, but I'll make the same recommendation for Retrosheet data that I always do: check out the Boxball distributions, specifically the CSV files (or the parquet files if you prefer). You can load those right in, rather than futzing with Retrosheet's event files and processing them manually, etc., just to get to the same spot.
-
Delta Aging Curve Python
The original data source is the Baseball Databank (aka "The Lahman Database"). I use the Boxball distributions, which include both retrosheet and Baseball Databank data.
- What's the current standard for getting mlb data into a database? I used to use Baseball On a Stick (which accessed the old gameday data) but that doesn't work now. I found "mlbdata" which accesses the mlb API but I can't figure out how to make it put stuff into a database. Is there a good option?
-
How do I compile a list of a team's games where event x did not happen?
You know, I thought about that after posting, that the dailies might not be available directly from retrosheet. I use the Boxball distributions, which take the retrosheet data and process it into database images. These distributions include a "daily" table with the daily logs. So apparently Boxball is generating that table.
-
Aggregate Game Logs
I used to think the same thing, and then I found the Boxball pre-built images. You still have to be a bit handy to get them working, but it's way easier than starting from scratch with raw retrosheet data. I initially used the postgres image and worked in SQL, and then switched to python/pandas and the parquet files.
-
Finding a player's stats through N games
I've always been good with data, math, Excel, etc., but decided to take it to another level during the pandemic. The most difficult thing about Retrosheet data is getting into a usable format. Luckily, the boxball project has created a number of ready-to-download images with all of the data, for different database technologies.
-
RE24 Data for wOBA Calculations
For Retrosheet data, I've been using pre-built images from Boxball, rather than going through the whole process of downloading and converting the files from retrosheet. The Boxball images are awesome. I mostly use the Parquet files for Pandas, but they have other formats (e.g., postgres Docker images, CSV, etc) there.
beisbol-analitica
-
I'd like to find a database that contains all-time player data, broken by season, for the Mexican and/or Korean pro leagues. Anyone know where to find that? Thanks!
You should check out the Beisbol Analitica project: https://github.com/juanitobanca/beisbol-analitica
-
Beisbol Analitica is now available in SQLite3
If you want to Collab, ping us over Github(https://github.com/andres-alvarado/beisbol-analitica/) or Twitter(http://www.twitter.com/juanitobanca). If you have feedback for us, please do let us know, we appreciate all comments.
-
Beisbol Analitica - The Open Source Baseball Database for MLB and LatAm v3 is ready for download
We have a bunch of features we want to work on. Please visit https://github.com/andres-alvarado/beisbol-analitica/issues for more information.
- Code For Linear Weight/ Run Value
-
New to Sabermetrics
If you are looking for databases, Beisbol Analitica has pitch by pitch, play by play data as well as aggregated metrics per game, season, team, etc for winter leagues and MLB: https://github.com/andres-alvarado/beisbol-analitica
-
Almost figured out player splits for "by Venue" endpoint.
Baseball analítica easily let’s u create a lot of different splits please check line 66: https://github.com/andres-alvarado/beisbol-analitica/blob/main/data/commons/procedimientos/master_procedure.sql
-
A need for a comprehensive Teams/Franchise Name and ID directory (Any suggestions?)
Béisbol analítica: https://github.com/andres-alvarado/beisbol-analitica
-
Question About Linear Weights
Check lines 60 to 105 of this code: https://github.com/andres-alvarado/beisbol-analitica/blob/main/data/run_expectancy/procedimientos/rem_event_run_value.sql . I basically get the value of an out. Then you can use that to calculate wOBA in the scale of OBP.
-
Trying to figure out how to query the MLB API with baseballR for multiple days in 1 query
Hi, looks like you are trying to clean tons of seasons of data. I have already done that for you. Please visit https://github.com/andres-alvarado/beisbol-analitica to download the mysql database.
-
RE24 Data for wOBA Calculations
If you would like to get REM24 for Caribbean and Mexican leagues you can use my open source project called Beisbol Analitca(https://github.com/andres-alvarado/beisbol-analitica/tree/main/data/run_expectancy/procedimientos) . The code is there in SQL. It generates data at different levels ( i.e [season, majorLeague], [season, majorLeague, stadium] ). Let me know if you need extra help.
What are some alternatives?
pybaseball - Pull current and historical baseball statistics using Python (Statcast, Baseball Reference, FanGraphs)
baseballdatabank - Development for baseball databank, an Open Data collection of historical baseball data
ElectroCRUD - Database CRUD Application Built on Electron | MySQL, Postgres, SQLite
baseballr - A package written for R focused on baseball analysis. Currently in development.
chadwick - Chadwick tools for manipulating baseball data
baseball_sql - SQL scripts for working with the baseball data from retrosheet and baseball-databank, as provided by boxball
condenser - Condenser is a database subsetting tool
MLB-StatsAPI - Python wrapper for MLB Stats API
mlbgame - A Python API to retrieve and read MLB GameDay data
sqlx - 🧰 The Rust SQL Toolkit. An async, pure Rust SQL crate featuring compile-time checked queries without a DSL. Supports PostgreSQL, MySQL, and SQLite.
goqu - SQL builder and query library for golang