PySpark style guide

This page summarizes the projects mentioned and recommended in the original post on /r/apachespark

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • spark-style-guide

    Spark style guide

  • I created a PySpark style guide to help the community write code that's easy to reuse, unit test, and debug. Feel free to open issues / PRs if you have any suggestions / improvements.

  • pyspark-style-guide

    This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.

  • For completeness, here is the Palantir PySpark style guide that has different guidance that you might also find interesting.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Error handling help

    1 project | /r/golang | 9 Mar 2023
  • Package is Deprecated because the Maintainer locked himself by accident

    1 project | news.ycombinator.com | 27 Nov 2022
  • Suggestions

    1 project | /r/dataengineering | 1 Nov 2022
  • How to get into Language Server Protocol? Any good tutorials?

    5 projects | /r/ProgrammingLanguages | 21 Sep 2022
  • Windows Event Forwarding - forward subset of events from one collector to another?

    1 project | /r/sysadmin | 25 Jul 2022