Top 5 Jupyter Notebook Spark Projects
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Building Large-Scale AI Applications for Distributed Big DataProject mention: Machine learning on JVM | reddit.com/r/scala | 2021-04-05
Intel BigDL for Spark which again is for Spark.
Run Linux Software Faster and Safer than Linux with Unikernels.
The Hunting ELKProject mention: Home lab with security monitoring tools? | reddit.com/r/netsecstudents | 2021-09-23
HELK can help for the SIEM and detection part
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.Project mention: Learning Spark Scala: I'm a medium Python Data Engineer with some experience in Java. I have to learn "enough" Scala to be at ease with Spark's Scala API. I have three weeks. Where should I start ? | reddit.com/r/scala | 2021-02-03
There's literally something called, "Just enough Scala for Spark." https://github.com/deanwampler/JustEnoughScalaForSpark
Getting started with Azure Synapse and Azure Data ExplorerProject mention: Getting started with Azure Data Explorer and Azure Synapse Analytics for Big Data processing | dev.to | 2021-07-16
Notebooks are available in this GitHub repo — https://github.com/abhirockzz/synapse-azure-data-explorer-101
Jupyter Notebook Spark related posts
What are some of the best open-source Spark projects in Jupyter Notebook? This list will help you:
Are you hiring? Post a new remote job listing for free.