The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 3 Jupyter Notebook Big Data Projects
-
H2O
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I would use H20 if I were you. You can try out LLMs with a nice GUI. Unless you have some familiarity with the tools needed to run these projects, it can be frustrating. https://h2o.ai/
Jupyter Notebook Big Data related posts
- Really struggling with open source models
- Democratizing Large Language Models
- Interview AI Coach - by email
- Top 10+ OpenAI Alternatives
- Stable Attribution
- Best machine learning framework(s) for production
- Software engineers: consider working on genomics
-
A note from our sponsor - WorkOS
workos.com | 26 Apr 2024
Index
What are some of the best open-source Big Data projects in Jupyter Notebook? This list will help you:
Project | Stars | |
---|---|---|
1 | H2O | 6,730 |
2 | csv-schema-inference | 31 |
3 | SDE | 22 |
Sponsored