Top 23 Jupyter Notebook Statistic Projects
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)Project mention: Predicting the distribution of a variable rather than a point estimate | reddit.com/r/datascience | 2022-08-14
You’re welcome! I would recommend Bayesian Methods for Hackers
Probabilistic reasoning and statistical analysis in TensorFlowProject mention: [P] Any good resources which can help me with Multivariate Time Series Forecasting using Probabilistic Machine Learning? | reddit.com/r/MachineLearning | 2022-08-14
Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.Project mention: [Project] BFLOAT16 on ALL hardware (>= 2009), up to 2000x faster ML algos, 50% less RAM usage for all old/new hardware - Hyperlearn Reborn. | reddit.com/r/MachineLearning | 2022-06-02
Hello everyone!! It's been a while!! Years back I released Hyperlearn https://github.com/danielhanchen/hyperlearn. It has 1.2K Github stars, where I made tonnes of algos faster:
Lightning ⚡️ fast forecasting with statistical and econometric models.Project mention: [Q] Weekly time series forecasting | reddit.com/r/statistics | 2022-08-02
It is available in the StatsForecast package.
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).Project mention: Random Forest Estimation Question | reddit.com/r/datascience | 2022-07-04
Option 2) fit a model from https://github.com/csinva/imodels on the predicted values of the RF
Computations and statistics on manifolds with geometric structures.Project mention: Package for Computations and Statistics on Manifolds | news.ycombinator.com | 2022-01-10
A simple probabilistic programming language.Project mention: Is Edward2 still a part of Tensorflow/Tensorflow Probability or is it discontinued? | reddit.com/r/tensorflow | 2022-06-23
Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
The code repository for projects and tutorials in R and Python that covers a variety of topics in data visualization, statistics sports analytics and general application of probability theory.
Repository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.Project mention: error: the following arguments are required: -i/--input-path, -o/--output-path How to define these? | reddit.com/r/learnpython | 2022-01-14
https://github.com/5agado/data-science-learning/tree/master/graphics/learn_to_paint - github
My notes and codes (jupyter notebooks) for the "The Elements of Statistical Learning" by Trevor Hastie, Robert Tibshirani and Jerome Friedman
Extensive and accessible COVID-19 data + forecasting for counties and hospitals. 📈
Porting the R code in ISL to python. Labs and exercisesProject mention: ESL vs ISLR books? | reddit.com/r/datascience | 2021-10-12
Here or here for the Python versions of ISLR.
Data science teaching materialsProject mention: Linear Programming | reddit.com/r/datascience | 2021-11-16
I've created a short course on linear programming - you can find the resources here - https://github.com/ADGEfficiency/teaching-monolith/tree/master/linear-programming
Pretrained GANs + VAEs + classifiers for MNIST/CIFAR in pytorch.Project mention: DCGAN (CIFAR-10) Generating fake images is easy, but how to also output the class label (1 to 10) with the fake generated images? | reddit.com/r/learnmachinelearning | 2022-03-13
I have this DCGAN model (https://github.com/csinva/gan-vae-pretrained-pytorch/tree/master/cifar10_dcgan) which generates fake Cifar-10 images. However I also want to get the intended class label output with the fake generated images. How can I do this? This model which I found only generates fake images but doesn't know what class the generated images belong to.
Wrapper for a PyTorch classifier which allows it to output prediction sets. The sets are theoretically guaranteed to contain the true class with high probability (via conformal prediction).Project mention: [R] Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification - Link to a free online lecture by the author in comments | reddit.com/r/MachineLearning | 2022-03-06
Uncertainty Sets for Image Classifiers using Conformal Prediction https://arxiv.org/abs/2009.14193 https://github.com/aangelopoulos/conformal_classification
Hierarchical 👑 forecasting with statistical and econometric methods.Project mention: Time series forecasting model predicts increasing number for target variable when the actual values are zeroes | reddit.com/r/datascience | 2022-08-01
You can try HierarchicalForecast package to reconciliate predictions.
PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features
Learn then Test: Calibrating Predictive Algorithms to Achieve Risk ControlProject mention: [R] Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification - Link to a free online lecture by the author in comments | reddit.com/r/MachineLearning | 2022-03-06
Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control https://arxiv.org/abs/2110.01052 https://github.com/aangelopoulos/ltt
Can we estimate the economic impact of EIP-1559 on miners? This repository try to estimate the loss of miners' revenue coming from transactions fees, using Ethereum historical data.
Benchmarking programming languages using statistics and machine learning algorithms
A statistical analysis of excess deaths attributable to extreme heat in California's most populous countiesProject mention: End filibuster | reddit.com/r/Political_Revolution | 2022-07-14
Confounded Domain AdapterProject mention: Great thread on the importance of EDA and confounder adjustment prior to differential expression analysis (RNA-seq) | reddit.com/r/bioinformatics | 2022-04-12
Fwiw, last week I released a new method for batch correction that conditions on confounders which are correlated with the batch variable. Preprint here: https://arxiv.org/abs/2203.12720 and Python code here: https://github.com/calvinmccarter/condo-adapter.
Jupyter Notebook Statistics related posts
[Q] Weekly time series forecasting
1 project | reddit.com/r/statistics | 2 Aug 2022
Only lost once (HOMER) but my 99% just turned back into 100% after hitting 200 played. Anyone else?
1 project | reddit.com/r/wordle | 1 Aug 2022
200 up this morning
1 project | reddit.com/r/wordle | 31 Jul 2022
[D] What are some statistical packages you use in R that aren't available in Python?
4 projects | reddit.com/r/statistics | 29 Jul 2022
[P] Fastest and most accurate version of the Exponential Smoothing (ETS) Algorithm for Python
3 projects | reddit.com/r/MachineLearning | 19 Jul 2022
Exponential Smoothing (ETS) for Python
1 project | news.ycombinator.com | 19 Jul 2022
[P] It's settled: AutoArima is a lot(!) faster and more accurate than FB-Prophet. Now you can replace it with just two lines of code without making changes to your pipeline
3 projects | reddit.com/r/MachineLearning | 9 May 2022
What are some of the best open-source Statistic projects in Jupyter Notebook? This list will help you:
Are you hiring? Post a new remote job listing for free.