Hadoop Clusters to K8S

This page summarizes the projects mentioned and recommended in the original post on /r/kubernetes

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • incubator-livy

    Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

  • Spark 3 has some solid support for Kubernetes. Livy is a good addition to Spark on K8S, although support is not merged upstream yet (there's publicly available images and other in the PR). I'd strongly recommend to avoid running HDFS on top of Kubernetes. Either cloud-native buckets or on-premise bucket storage like Minio would be much better suited.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Doing ML works in AWS. Need help installing cartopy

    2 projects | /r/aws | 5 Jun 2023
  • Sparkless is born

    2 projects | /r/apachespark | 24 Nov 2022
  • State of connecting (Jupyter) notebooks to remote Spark 3+ clusters

    2 projects | /r/apachespark | 26 Feb 2022
  • Spark is lit onceĀ again

    6 projects | dev.to | 29 Oct 2021