Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →
Top 23 Jupyter Notebook Data Science Projects
-
-
Nutrient
Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
-
Get started with Data Science in the Data Science for Beginners curricula.
-
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
-
-
-
python-machine-learning-book
The "Python Machine Learning (1st edition)" book code repository and info resource
-
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
numerical-linear-algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
-
amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
-
-
-
Project mention: JPMorgan's Python training for business analysts and traders | news.ycombinator.com | 2024-08-29
-
H2O
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
-
evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Project mention: Evidently: Open-source ML observability platform | news.ycombinator.com | 2024-12-12 -
machine_learning_complete
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
-
-
-
MachineLearningNotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
-
Data-science
Collection of useful data science topics along with articles, videos, and code (by khuyentran1401)
-
-
cracking-the-data-science-interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
-
ML-foundations
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Jupyter Notebook Data Science discussion
Jupyter Notebook Data Science related posts
-
ML Papers of the Week
-
Arsenal FC AI Research Engineer Job Posting
-
Statistical Rethinking (2024 Edition)
-
PiML: Python Interpretable Machine Learning Toolbox
-
Show HN: Create Data Visualization with Data Formulator from Microsoft Research
-
Ask HN: Why all these GitHub fake accounts starring my project
-
Show HN: Hamilton's UI – observability, lineage, and catalog for data pipelines
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 18 Feb 2025
Index
What are some of the best open-source Data Science projects in Jupyter Notebook? This list will help you:
# | Project | Stars |
---|---|---|
1 | Made-With-ML | 38,163 |
2 | Data-Science-For-Beginners | 28,798 |
3 | Probabilistic-Programming-and-Bayesian-Methods-for-Hackers | 27,187 |
4 | fastbook | 22,578 |
5 | machine-learning-for-trading | 14,126 |
6 | python-machine-learning-book | 12,340 |
7 | ML-Papers-of-the-Week | 10,832 |
8 | numerical-linear-algebra | 10,364 |
9 | amazon-sagemaker-examples | 10,282 |
10 | pycaret | 9,131 |
11 | tsfresh | 8,561 |
12 | python-training | 7,638 |
13 | H2O | 7,035 |
14 | evidently | 5,694 |
15 | machine_learning_complete | 4,671 |
16 | nlpaug | 4,503 |
17 | probability | 4,293 |
18 | MachineLearningNotebooks | 4,145 |
19 | Data-science | 4,077 |
20 | FLAML | 4,045 |
21 | cracking-the-data-science-interview | 3,879 |
22 | ML-foundations | 3,834 |
23 | ML-Workspace | 3,471 |