Python Kaggle

Open-source Python projects categorized as Kaggle

Top 11 Python Kaggle Projects

  1. data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. d2l-en

    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

  4. Pytorch-UNet

    PyTorch implementation of the U-Net for image semantic segmentation with high quality images

  5. Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials

    A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

  6. pytorch-toolbelt

    PyTorch extensions for fast R&D prototyping and Kaggle farming

  7. MLBox

    MLBox is a powerful Automated Machine Learning python library.

  8. dfdc_deepfake_challenge

    A prize winning solution for DFDC challenge

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. upgini

    Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs

  11. xgboost_ray

    Distributed XGBoost on Ray

  12. Paper-Recommendation-System

    Web interface to search ArXiv papers using NLP Sentence-Transformers, Faiss and Streamlit

  13. ailert

    An open-source platform that aggregates AI content from 230+ sources including research papers, GitHub trends, and industry news, making AI knowledge accessible to everyone.

    Project mention: Building an Open-Source AI Newsletter Engine: The Story of AiLert | dev.to | 2025-01-12

    Code: https://github.com/anuj0456/ailert Docs: https://github.com/anuj0456/ailert/blob/main/README.md

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Kaggle discussion

Log in or Post with

Python Kaggle related posts

  • The fastest way to improve quality of ML model on tabular data

    1 project | /r/learnmachinelearning | 18 Jun 2023
  • How are deepfakes different from beauty face filters?

    1 project | /r/computervision | 27 May 2023
  • [Project] Google ArXiv Papers with NLP semantic-search! Link to Github in the comments!!

    1 project | /r/MachineLearning | 19 Feb 2023
  • How to enrich ML models with open data for free: an in-depth review of 5 python libraries

    1 project | /r/Python | 2 Sep 2022
  • How I complete my email addresses lists with demographic insights with Python

    1 project | /r/Python | 27 Jul 2022
  • [OC] Divorced relationship status share of users at Facebook

    1 project | /r/dataisbeautiful | 9 Jul 2022
  • GitHub - searching open and public data through autoML. Please give a Star on GitHub

    1 project | /r/programming | 1 Jul 2022
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 19 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source Kaggle projects in Python? This list will help you:

# Project Stars
1 data-science-ipython-notebooks 27,993
2 d2l-en 25,832
3 Pytorch-UNet 10,098
4 Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials 3,866
5 pytorch-toolbelt 1,541
6 MLBox 1,514
7 dfdc_deepfake_challenge 812
8 upgini 331
9 xgboost_ray 148
10 Paper-Recommendation-System 21
11 ailert 21

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?