SaaSHub helps you find the best software and product alternatives Learn more →
Top 3 Jupyter Notebook Big Data Projects
-
H2O
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
View the Project on GitHub
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
Jupyter Notebook Big Data discussion
Jupyter Notebook Big Data related posts
-
H2O: Your New Best Friend for Scalable Machine Learning
-
Really struggling with open source models
-
Democratizing Large Language Models
-
Interview AI Coach - by email
-
Top 10+ OpenAI Alternatives
-
Stable Attribution
-
Best machine learning framework(s) for production
-
A note from our sponsor - SaaSHub
www.saashub.com | 17 May 2025
Index
What are some of the best open-source Big Data projects in Jupyter Notebook? This list will help you:
# | Project | Stars |
---|---|---|
1 | H2O | 7,158 |
2 | pyspark-tutorial | 1,219 |
3 | csv-schema-inference | 35 |