Jupyter Notebook Machine Learning

Open-source Jupyter Notebook projects categorized as Machine Learning

Top 23 Jupyter Notebook Machine Learning Projects

Machine Learning
  1. llm-course

    Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

    Project mention: Ask HN: People who switched from GPT to their own models. How was it? | news.ycombinator.com | 2024-02-26

    This is a very nice resource: https://github.com/mlabonne/llm-course

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. TensorFlow-Examples

    TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

    Project mention: Top Github repositories for 10+ programming languages | dev.to | 2024-07-16

    Tensorflow examples

  4. Made-With-ML

    Learn how to design, develop, deploy and iterate on production-grade ML applications.

  5. AI-For-Beginners

    12 Weeks, 24 Lessons, AI for All!

    Project mention: The Top 9️⃣ Repositories to learn Python programming + Resources (Extra) 🤯 | dev.to | 2024-11-06

    ⭐️ AI For Beginners on GitHub.

  6. google-research

    Google Research

    Project mention: Phi4 Available on Ollama | news.ycombinator.com | 2025-01-11

    I tested Phi-4 with a Japanese functional test suite and it scored much better than prior Phis (and comparable to much larger models, basically in the top tier atm). [1]

    The one red-flag w/ Phi-4 is that it's IFEval score is relatively low. IFEval has specific types of constraints (forbidden words, capitalization, etc) it tests for [2] but its one area especially worth keeping an eye out for those testing Phi-4 for themselves...

    [1] https://docs.google.com/spreadsheets/u/3/d/18n--cIaVt49kOh-G...

    [2] https://github.com/google-research/google-research/blob/mast...

  7. CLIP

    CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

    Project mention: We used GPT-4o for image detection with 350 similar illustrations | news.ycombinator.com | 2025-01-14

    Yes, you could implement image similarity search using embeddings: Create embeddings for the entire image set, save the embeddings in a database, and add embeddings incrementally as new images come in. To search for a similar image, create the embedding for the image that you are looking for and compute the cosine similarity between that embedding and the embeddings in your database. The closer the cosine similarity is to 1.0 the more similar the images.

    For choosing a model, the article mentions the AWS Titan multimodal model, but you’d have to pay for API access to create the embeddings. Alternatively, self-hosting the CLIP model [0] to create embeddings would avoid API costs.

    Follow-up question: Would the embeddings from the llama3.2-vision models be of higher quality (contain more information) than the original CLIP model?

    The llama vision models use CLIP under the hood, but they add a projection head to align with the text model and the CLIP weights are mutated during alignment training, so I assume the llama vision embeddings would be of higher quality, but I don’t know for sure. Does anybody know?

    (I would love to test this quality myself but Ollama does not yet support creating image embeddings from the llama vision models - a feature request with several upvotes has been opened [1].)

    [0] https://github.com/openai/CLIP

  8. fastai

    The fastai deep learning library

    Project mention: Integrate Hugging Face Spaces & Gradio with a React application | dev.to | 2024-10-12

    We are going to use fast.ai to create a simple classification which predicts whether a given image is a cat or not.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. handson-ml

    ⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

  11. shap

    A game theoretic approach to explain the output of any machine learning model.

    Project mention: IA Explicable: Algoritmos y Métodos para Interpretar Modelos de Caja Negra | dev.to | 2024-06-19
  12. homemade-machine-learning

    🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explained

  13. fastbook

    The fastai book, published as Jupyter Notebooks

  14. learnopencv

    Learn OpenCV : C++ and Python Examples

  15. llama-cookbook

    Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

    Project mention: AIM Weekly for 04Nov2024 | dev.to | 2024-11-04

    🌐 Composed Image Retrieval 📎 Intro to Multimodal LLama 3.2 🛠️ Multi Agent Concierge 💻 RAG with Langchain Granite, Milvus 🫶 Download content ✅ Transformer Replacement? 🤖 vLLM for runing models 🌐 Amphion 📝 Autogluon 🚙 Notebook LLama like Google's Notebook LLM 🫶 Monocle2ai for tracing GenAI app code LFA&D Project 🤖 Bee Agent Framework ✅ LLama RFP Response ▶️ GenAI Script 👽 Simular AI Agent S 🦾 DrawDB with AI ✨ Ollama with LLama 3.2 Vision!!!! Preview 🚕 Powerful RAG Checker 📊 SQL Generator 💻 Role of LLMs 🐍 Document Extraction 🕶️ Open Source Vector DB Reddit 🍔 The Practical Guide to Self Hosting LLM 🦾 Stagehand Controller 🕶️ Understanding HNSWLIB 🐍 Best practices in RAG 💻 Enigma Agent 📝 Langchain, Ollama, Phi3 for Function Calling 🔋 Compass Judger 📝 Princeton NLP SimPO 🍔 Princeton NLP ProLong 🔋 Princeton NLP HELMET 🧐 Ollama Cheatsheet 🚕 Princeton NLP CopyCat 📊 Princeton NLP Shp 🕶️ Can LLM Solve Hard Github Issues 📝 Enabling Large Language Models to Generate Text with Citations 🔋 Princeton NLP CharXiv 📊 Awesome AI Agents List 🦾 Nomic’s Matryoshka text embedding model

  16. FinGPT

    FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

    Project mention: About FinGPT: Open-Source Financial Large Language Models | news.ycombinator.com | 2024-08-28
  17. machine-learning-for-trading

    Code for Machine Learning for Algorithmic Trading, 2nd edition.

  18. llm-app

    Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

    Project mention: Show HN: Private RAG with Mistral, Ollama and Pathway | news.ycombinator.com | 2024-07-17

    We’re excited to share our fully local RAG setup, which has received great feedback from the community.

    Full app: https://github.com/pathwaycom/llm-app/tree/main/examples/pip...

    We also feature tools and parsers for various RAG use cases, including document parsing with local vision LMs and some other cool stuff.

  19. python-machine-learning-book

    The "Python Machine Learning (1st edition)" book code repository and info resource

  20. mlops-zoomcamp

    Free MLOps course from DataTalks.Club

    Project mention: Wk 4: Deployment - MLOPs with DataTalks | dev.to | 2024-07-07

    However, let's focus on how to get the assignment done with.

  21. ML-Papers-of-the-Week

    🔥Highlighting the top ML papers every week.

  22. GPU-Puzzles

    Solve puzzles. Learn CUDA.

    Project mention: Ask HN: Resources for General Purpose GPU development on Apple's M* chips? | news.ycombinator.com | 2024-12-25

    If you're looking for a high level introduction to GPU development on Apple silicon I would recommend learning Metal. It's Apple's GPU acceleration language similar to CUDA for Nvidia hardware. I ported a set of puzzles for CUDA called GPU-Puzzles (a collection of exercises designed to teach GPU programming fundamentals)[1] to Metal [2]. I think it's a very accessible introduction to Metal and writing GPU kernels.

    [1] https://github.com/srush/GPU-Puzzles

    [2] https://github.com/abeleinin/Metal-Puzzles

  23. numerical-linear-algebra

    Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

  24. amazon-sagemaker-examples

    Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

  25. TensorFlow-Tutorials

    TensorFlow Tutorials with YouTube Videos

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Machine Learning discussion

Log in or Post with

Jupyter Notebook Machine Learning related posts

Index

What are some of the best open-source Machine Learning projects in Jupyter Notebook? This list will help you:

# Project Stars
1 llm-course 45,659
2 TensorFlow-Examples 43,493
3 Made-With-ML 38,126
4 AI-For-Beginners 35,908
5 google-research 34,822
6 CLIP 27,248
7 fastai 26,584
8 handson-ml 25,236
9 shap 23,347
10 homemade-machine-learning 23,285
11 fastbook 22,464
12 learnopencv 21,555
13 llama-cookbook 16,099
14 FinGPT 14,797
15 machine-learning-for-trading 14,032
16 llm-app 12,628
17 python-machine-learning-book 12,336
18 mlops-zoomcamp 11,437
19 ML-Papers-of-the-Week 10,768
20 GPU-Puzzles 10,449
21 numerical-linear-algebra 10,347
22 amazon-sagemaker-examples 10,272
23 TensorFlow-Tutorials 9,279

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai