Our great sponsors
-
boxball
Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.
-
baseball_sql
SQL scripts for working with the baseball data from retrosheet and baseball-databank, as provided by boxball
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
I've always been good with data, math, Excel, etc., but decided to take it to another level during the pandemic. The most difficult thing about Retrosheet data is getting into a usable format. Luckily, the boxball project has created a number of ready-to-download images with all of the data, for different database technologies.
I started with SQL and then moved to python/pandas. And basically it just took a lot of practice; any time I hear a fun fact or trivia question, I see if I can replicate it myself. Even if I know something is easy to do with Stathead, I've forced myself to code it myself. I moved on to harder and more interesting stuff, like component park factors, aging curves, umpire effects, etc. Some of my code is here.