Spark is lit once again

This page summarizes the projects mentioned and recommended in the original post on dev.to

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • incubator-livy

    Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

    Apache Livy - an open-source REST API for interacting with Apache Spark from anywhere.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • incubator-livy

    Mirror of Apache livy (Incubating) (by jahstreet)

    Latter was a go-to solution at the time when we were only using Spark on YARN. Sadly Apache Livy is not maintained anymore: it has no K8s support, Spark client is more and more outdated with every passing day. For some time we used @jahstreet's fork which had K8s available. But then we saw that the Livy project hadn't received any updates and we decided to implement our own solution - Exacaster Lighter.

  • lighter

    REST API for Apache Spark on K8S or YARN

    Latter was a go-to solution at the time when we were only using Spark on YARN. Sadly Apache Livy is not maintained anymore: it has no K8s support, Spark client is more and more outdated with every passing day. For some time we used @jahstreet's fork which had K8s available. But then we saw that the Livy project hadn't received any updates and we decided to implement our own solution - Exacaster Lighter.

  • sparkmagic

    Jupyter magics and kernels for working with remote Spark clusters

    Things get a bit more complicated on interactive sessions. We've created Sparkmagic compatible REST API so that Sparkmagic kernel could communicate with Lighter the same way as it does with Apache Livy. When a user creates an interactive session Lighter server submits a custom PySpark application which contains an infinite loop which constantly checks for new commands to be executed. Each Sparkmagic command is saved on Java collection, retrieved by the PySpark application through Py4J Gateway and executed.

  • Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

    Here at Exacaster Spark applications have been used extensively for years. We started using them on our Hadoop clusters with YARN as an application manager. However, with our recent product, we started moving towards a Cloud-based solution and decided to use Kubernetes for our infrastructure needs.

  • Apache Hadoop

    Apache Hadoop

    Here at Exacaster Spark applications have been used extensively for years. We started using them on our Hadoop clusters with YARN as an application manager. However, with our recent product, we started moving towards a Cloud-based solution and decided to use Kubernetes for our infrastructure needs.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Unveiling the Analytics Industry in Bangalore

    3 projects | /r/u_Khushisondhi7 | 23 Mar 2023
  • 5 Best Practices For Data Integration To Boost ROI And Efficiency

    3 projects | /r/ReviewNPrep | 12 Mar 2023
  • Big Data Processing, EMR with Spark and Hadoop | Python, PySpark

    2 projects | dev.to | 27 Mar 2022
  • Spark for beginners - and you

    3 projects | dev.to | 22 Dec 2021
  • Hadoop Installation and Deployment Guide

    1 project | dev.to | 21 Aug 2024

Did you konow that Scala is
the 33rd most popular programming language
based on number of metions?