SaaSHub helps you find the best software and product alternatives Learn more โ
Top 23 Kaggle Open-Source Projects
-
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
-
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
-
Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
-
catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Project mention: ๐ Why Your ML Service Needs Rust + CatBoost: A Setup Guide That Actually Works | dev.to | 2025-01-19[package] name = "MLApp" version = "0.1.0" edition = "2021" [dependencies] catboost = { git = "https://github.com/catboost/catboost", rev = "0bfdc35"}
-
-
Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials
A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.
-
Nutrient
Nutrient - The #1 PDF SDK Library. Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrientโs PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.
-
-
-
-
upgini
Data search & enrichment library for Machine Learning โ Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
-
-
-
-
deepfake-detection
DeepFake Detection: Detect the video is fake or not using InceptionResNetV2. (by xinyooo)
-
-
-
-
Paper-Recommendation-System
Web interface to search ArXiv papers using NLP Sentence-Transformers, Faiss and Streamlit
-
-
ailert
An open-source platform that aggregates AI content from 230+ sources including research papers, GitHub trends, and industry news, making AI knowledge accessible to everyone.
Project mention: Building an Open-Source AI Newsletter Engine: The Story of AiLert | dev.to | 2025-01-12Code: https://github.com/anuj0456/ailert Docs: https://github.com/anuj0456/ailert/blob/main/README.md
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Kaggle discussion
Kaggle related posts
-
The fastest way to improve quality of ML model on tabular data
-
How are deepfakes different from beauty face filters?
-
[Project] Google ArXiv Papers with NLP semantic-search! Link to Github in the comments!!
-
[P] Collection of Kaggle Past Solutions (to learn ideas and techniques)
-
How to enrich ML models with open data for free: an in-depth review of 5 python libraries
-
Completed all the Kaggle courses.
-
How I complete my email addresses lists with demographic insights with Python
-
A note from our sponsor - SaaSHub
www.saashub.com | 13 Feb 2025
Index
What are some of the best open-source Kaggle projects? This list will help you:
# | Project | Stars |
---|---|---|
1 | data-science-ipython-notebooks | 27,837 |
2 | d2l-en | 24,864 |
3 | LightGBM | 16,937 |
4 | Pytorch-UNet | 9,645 |
5 | catboost | 8,242 |
6 | kaggle-solutions | 5,074 |
7 | Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials | 3,817 |
8 | pytorch-toolbelt | 1,529 |
9 | MLBox | 1,503 |
10 | dfdc_deepfake_challenge | 795 |
11 | upgini | 322 |
12 | benchmarks | 169 |
13 | xgboost_ray | 147 |
14 | crypto | 143 |
15 | deepfake-detection | 101 |
16 | Hello-Kaggle | 80 |
17 | kaggle-courses | 53 |
18 | kaggle-look-alike | 33 |
19 | Paper-Recommendation-System | 20 |
20 | apple-appstore-apps | 19 |
21 | ailert | 13 |
22 | YouTubers-saying-things | 8 |
23 | YouTube-thumbnail-dataset | 4 |