Jupyter Notebook Data Science

Open-source Jupyter Notebook projects categorized as Data Science

Top 23 Jupyter Notebook Data Science Projects

Data Science
  1. Made-With-ML

    Learn how to design, develop, deploy and iterate on production-grade ML applications.

  2. Nutrient

    Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.

    Nutrient logo
  3. Data-Science-For-Beginners

    10 Weeks, 20 Lessons, Data Science for All!

    Project mention: Welcome to 14 days of Data Science! | dev.to | 2024-03-07

    Get started with Data Science in the Data Science for Beginners curricula.

  4. Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

    aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

  5. fastbook

    The fastai book, published as Jupyter Notebooks

  6. machine-learning-for-trading

    Code for Machine Learning for Algorithmic Trading, 2nd edition.

  7. python-machine-learning-book

    The "Python Machine Learning (1st edition)" book code repository and info resource

  8. ML-Papers-of-the-Week

    🔥Highlighting the top ML papers every week.

    Project mention: ML Papers of the Week | news.ycombinator.com | 2025-02-11
  9. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  10. numerical-linear-algebra

    Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

  11. amazon-sagemaker-examples

    Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

  12. pycaret

    An open-source, low-code machine learning library in Python

  13. tsfresh

    Automatic extraction of relevant features from time series:

  14. python-training

    Python training for business analysts and traders

    Project mention: JPMorgan's Python training for business analysts and traders | news.ycombinator.com | 2024-08-29
  15. H2O

    H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  16. evidently

    Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

    Project mention: Evidently: Open-source ML observability platform | news.ycombinator.com | 2024-12-12
  17. machine_learning_complete

    A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

  18. nlpaug

    Data augmentation for NLP

  19. probability

    Probabilistic reasoning and statistical analysis in TensorFlow

  20. MachineLearningNotebooks

    Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft

  21. Data-science

    Collection of useful data science topics along with articles, videos, and code (by khuyentran1401)

  22. FLAML

    A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

  23. cracking-the-data-science-interview

    A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

  24. ML-foundations

    Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science

  25. ML-Workspace

    🛠 All-in-one web-based IDE specialized for machine learning and data science.

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Data Science discussion

Log in or Post with

Jupyter Notebook Data Science related posts

  • ML Papers of the Week

    1 project | news.ycombinator.com | 11 Feb 2025
  • Arsenal FC AI Research Engineer Job Posting

    5 projects | news.ycombinator.com | 25 Jan 2025
  • Statistical Rethinking (2024 Edition)

    3 projects | news.ycombinator.com | 16 Nov 2024
  • PiML: Python Interpretable Machine Learning Toolbox

    2 projects | news.ycombinator.com | 5 Nov 2024
  • Show HN: Create Data Visualization with Data Formulator from Microsoft Research

    4 projects | news.ycombinator.com | 21 Oct 2024
  • Ask HN: Why all these GitHub fake accounts starring my project

    1 project | news.ycombinator.com | 9 May 2024
  • Show HN: Hamilton's UI – observability, lineage, and catalog for data pipelines

    1 project | news.ycombinator.com | 2 May 2024
  • A note from our sponsor - CodeRabbit
    coderabbit.ai | 18 Feb 2025
    Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →

Index

What are some of the best open-source Data Science projects in Jupyter Notebook? This list will help you:

# Project Stars
1 Made-With-ML 38,163
2 Data-Science-For-Beginners 28,798
3 Probabilistic-Programming-and-Bayesian-Methods-for-Hackers 27,187
4 fastbook 22,578
5 machine-learning-for-trading 14,126
6 python-machine-learning-book 12,340
7 ML-Papers-of-the-Week 10,832
8 numerical-linear-algebra 10,364
9 amazon-sagemaker-examples 10,282
10 pycaret 9,131
11 tsfresh 8,561
12 python-training 7,638
13 H2O 7,035
14 evidently 5,694
15 machine_learning_complete 4,671
16 nlpaug 4,503
17 probability 4,293
18 MachineLearningNotebooks 4,145
19 Data-science 4,077
20 FLAML 4,045
21 cracking-the-data-science-interview 3,879
22 ML-foundations 3,834
23 ML-Workspace 3,471

Sponsored
Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers
Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
www.nutrient.io