How do you reduce information leakage and bias when going from descriptive analytics to prescriptive analytics?

This page summarizes the projects mentioned and recommended in the original post on reddit.com/r/datascience

Our great sponsors
  • Scout APM - Less time debugging, more time building
  • SonarQube - Static code analysis for 29 languages.
  • SaaSHub - Software Alternatives and Reviews
  • scikit-learn

    scikit-learn: machine learning in Python

    I'd say, the first question you'd need to ask yourself is "Why do I want to do statistical tests" and "what kind of statistical tests do I want to do?". Most of them rely on a bunch of assumptions and just winging it will produce a number that will be reported and used but is terribly wrong. Funnily enough, scikit-learn does not directly give you p-values for this very reason and advise you to run the same regression in statsmodels.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts