Jupyter Notebook Spark

Open-source Jupyter Notebook projects categorized as Spark | Edit details

Top 5 Jupyter Notebook Spark Projects

  • GitHub repo H2O

    H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  • GitHub repo BigDL

    Building Large-Scale AI Applications for Distributed Big Data

    Project mention: Machine learning on JVM | reddit.com/r/scala | 2021-04-05

    Intel BigDL for Spark which again is for Spark.

  • Nanos

    Run Linux Software Faster and Safer than Linux with Unikernels.

  • GitHub repo HELK

    The Hunting ELK

    Project mention: Home lab with security monitoring tools? | reddit.com/r/netsecstudents | 2021-09-23

    HELK can help for the SIEM and detection part

  • GitHub repo JustEnoughScalaForSpark

    A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.

    Project mention: Learning Spark Scala: I'm a medium Python Data Engineer with some experience in Java. I have to learn "enough" Scala to be at ease with Spark's Scala API. I have three weeks. Where should I start ? | reddit.com/r/scala | 2021-02-03

    There's literally something called, "Just enough Scala for Spark." https://github.com/deanwampler/JustEnoughScalaForSpark

  • GitHub repo synapse-azure-data-explorer-101

    Getting started with Azure Synapse and Azure Data Explorer

    Project mention: Getting started with Azure Data Explorer and Azure Synapse Analytics for Big Data processing | dev.to | 2021-07-16

    Notebooks are available in this GitHub repo — https://github.com/abhirockzz/synapse-azure-data-explorer-101

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-09-23.

Jupyter Notebook Spark related posts


What are some of the best open-source Spark projects in Jupyter Notebook? This list will help you:

Project Stars
1 H2O 5,637
2 BigDL 3,806
3 HELK 3,080
4 JustEnoughScalaForSpark 610
5 synapse-azure-data-explorer-101 1
Find remote jobs at our new job board 99remotejobs.com. There are 32 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives