What Are the Most Important Statistical Ideas of the Past 50 Years?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • shap

    A game theoretic approach to explain the output of any machine learning model.

  • Seconding Chris Molnar's excellent writeup. I also find the readme & example notebooks in Scott Lundberg's github repo to be a great way to get started. There are also references there for the original papers, which are surprisingly readable, imo. https://github.com/slundberg/shap

  • interpret

    Fit interpretable models. Explain blackbox machine learning.

  • You may also find Explainable Boosting Machines interesting: https://github.com/interpretml/interpret

    They're a bit like a best of both worlds between linear models and random forests (generalized additive models fit with boosted decision trees)

    Disclosure: I helped build this open source package

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts