Apache Spark on Apple m1

This page summarizes the projects mentioned and recommended in the original post on /r/apachespark

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • unitTestPySpark

    how to unit test your PySpark code

  • Yes, you can set it up, I did on my `m1`. I would recommend using a Dockerfile with Spark on it instead though, fewer headaches, easy to use. Here is one that works. https://github.com/danielbeach/unitTestPySpark/blob/main/Dockerfile

  • console

    Discontinued Open source data infrastructure platform. Designed for developers, built for speed. (by GigahexHQ)

  • It was open sourced last weekend. https://github.com/GigahexHQ/gigahex

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • 3 Reasons why we need an Open Source Data Infrastructure Platform

    1 project | dev.to | 7 Mar 2022
  • Introducing Gigahex - An open source data infrastructure platform

    1 project | /r/scala | 22 Feb 2022
  • Data Engineering Zoomcamp Week 6 - using redpanda 1

    1 project | dev.to | 9 Apr 2024
  • Final project part 5

    1 project | dev.to | 3 Apr 2024
  • Building a project in DBT

    1 project | dev.to | 23 Feb 2024