Manage all types of time series data in a single, purposebuilt database. Run at any scale in any environment in the cloud, onpremises, or at the edge. Learn more →
Top 23 Jupyter Notebook Statistic Projects

ProbabilisticProgrammingandBayesianMethodsforHackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understandingfirst, mathematicssecond point of view. All in pure Python ;)
Also this is quite nice practical introduction which might help with finding answers to your questions: https://github.com/CamDavidsonPilon/ProbabilisticProgrammingandBayesianMethodsforHackers

Project mention: How often do you see Bayesian Statistics or Stan in the DS world? Essential skill or a nice to have?  /r/datascience  20230617
TensorFlowProbability

SonarLint
Clean code begins in your IDE with SonarLint. Up your coding game and discover issues early. SonarLint is a free plugin that helps you find & fix bugs and security issues from the moment you start writing code. Install from your favorite IDE marketplace today.

crackingthedatascienceinterview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Project mention: Can someone recommend some website for data science interview preparation  /r/datascience  20230602 
MLfoundations
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
As others have said, you won't need calculus immediately, but it's important that you make a good attempt at learning up to Calc3. I also didn't have a math heavy undergrad so it took a lot of selfstudy for me, but it's possible. Simulation has a great math boot camp at the beginning to review everything but you'll want to be prepped with Calc before that because that class is all calculus based probability. Some other good resources are the 3Blue1Brown videos on YouTube. They have a great series for both calc & linear algebra to talk through all the intuition with visuals. I also really like John Krohns series because you code through the math which is very applicable for us in this program. I only did his linear Algebra, but he has a whole series with Calc and probability, too. https://github.com/jonkrohn/MLfoundations



imodels
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearncompatible).
Project mention: [D] Have researchers given up on traditional machine learning methods?  /r/MachineLearning  20230131 all domains requiring high interpretability absolutely ignore deep learning at all, and put all their research into traditional ML; see e.g. counterfactual examples, important interpretability methods in finance, or rulebased learning, important in medical or law applications

Mergify
Updating dependencies is timeconsuming.. Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.

fecon235
Notebooks for financial economics. Keywords: Jupyter notebook pandas Federal Reserve FRED Ferbus GDP CPI PCE inflation unemployment wage income debt CaseShiller housing asset portfolio equities SPX bonds TIPS rates currency FX euro EUR USD JPY yen XAU gold Brent WTI oil HoltWinters timeseries forecasting statistics econometrics
Project mention: Financial Economics: Financial Economics Models. Extended Research  star count:915.0  /r/algoprojects  20230610 
Project mention: What is the most performant way to visualize pytorch tensor that stores an image in an interactive manner?  /r/learnmachinelearning  20230129
Maybe this: https://github.com/xl0/lovelytensors



DataScienceProjects
The code repository for projects and tutorials in R and Python that covers a variety of topics in data visualization, statistics sports analytics and general application of probability theory.

BasicMathematicsforMachineLearning
The motive behind Creating this repo is to feel the fear of mathematics and do what ever you want to do in Machine Learning , Deep Learning and other fields of AI

datasciencelearning
Repository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.

theelementsofstatisticallearning
My notes and codes (jupyter notebooks) for the "The Elements of Statistical Learning" by Trevor Hastie, Robert Tibshirani and Jerome Friedman


stanfordCS229
Python solutions to the problem sets of Stanford's graduate course on Machine Learning, taught by Prof. Andrew Ng


covid19severityprediction
Extensive and accessible COVID19 data + forecasting for counties and hospitals. 📈

conformal_classification
Wrapper for a PyTorch classifier which allows it to output prediction sets. The sets are theoretically guaranteed to contain the true class with high probability (via conformal prediction).
Project mention: [P] 🚀 AWS launches Fortuna, an opensource library for Uncertainty Quantification  /r/MachineLearning  20230104What is the best endtoend example showing it? https://github.com/awslabs/fortuna/blob/main/examples/mnist_classification.ipynb ? It would be nice to have some visual explainer, as in https://github.com/aangelopoulos/conformal_classification .

Project mention: Andrew Ng's Machine Learning Specialization or Introduction to Statistical Learning? For someone who's comfortable with mathematics.  /r/learnmachinelearning  20230528
https://github.com/emredjan/ISLpython this GitHub has the exercises in python but I am so pumped the python version is coming out this summer.

Introduction_to_statistical_learning_summary_python
Summary of each chapter of the book Introduction of Statistical Learning (ISL), along with Python code & data.
Project mention: Can you recommend a Python textbook to replace "An Introduction to Statistical Learning with Applications in R", Witten, J. et. al. [E]  /r/statistics  20221212There is Python code available for that book.


InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purposebuilt database. Run at any scale in any environment in the cloud, onpremises, or at the edge.
Jupyter Notebook Statistics related posts
 Apple Maps Gradually Winning over Google Maps Users, Report Suggests
 An Introduction to Statistical Learning with Applications in Python
 Andrew Ng's Machine Learning Specialization or Introduction to Statistical Learning? For someone who's comfortable with mathematics.
 Statistics about how many edits are from users from which year? (e.g. in 2022, a little less than 50% of all edits are from people who had their first edit between 20182022)
 What is the most performant way to visualize pytorch tensor that stores an image in an interactive manner?
 datacamp: NEW Courses  star count:200.0
 datacamp: NEW Courses  star count:200.0

A note from our sponsor  InfluxDB
www.influxdata.com  27 Sep 2023
Index
What are some of the best opensource Statistic projects in Jupyter Notebook? This list will help you:
Project  Stars  

1  ProbabilisticProgrammingandBayesianMethodsforHackers  25,865 
2  probability  3,979 
3  crackingthedatascienceinterview  2,921 
4  MLfoundations  2,418 
5  hyperlearn  1,434 
6  ppd599  1,232 
7  imodels  1,173 
8  fecon235  1,011 
9  lovelytensors  1,002 
10  geomstats  995 
11  edward2  653 
12  DataScienceProjects  558 
13  BasicMathematicsforMachineLearning  535 
14  datasciencelearning  393 
15  theelementsofstatisticallearning  382 
16  monadbayes  379 
17  stanfordCS229  375 
18  datacamp  264 
19  covid19severityprediction  226 
20  conformal_classification  184 
21  ISLpython  178 
22  Introduction_to_statistical_learning_summary_python  157 
23  ganvaepretrainedpytorch  153 