Unit testing your PySpark library

This page summarizes the projects mentioned and recommended in the original post on dev.to

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • example-pyspark-unittesting

    An example of unit testing PySpark code

  • If you want to have a look at the code, or use it in any way (it's MIT licensed), then you can find it over on Github.

  • Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

  • In software development we often unit test our code (hopefully). And code written for Spark is no different. So here I want to run through an example of building a small library using PySpark and unit testing it. I'm using Visual Studio Code as my editor here, mostly because I think it's brilliant, but other editors are available.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Visual Studio Code

    Visual Studio Code

  • In software development we often unit test our code (hopefully). And code written for Spark is no different. So here I want to run through an example of building a small library using PySpark and unit testing it. I'm using Visual Studio Code as my editor here, mostly because I think it's brilliant, but other editors are available.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Developing a Generic Streamlit UI to Test Amazon Bedrock Agents

    4 projects | dev.to | 5 May 2024
  • How to Develop a User Data Storage Registration Form Using Python.

    3 projects | dev.to | 28 Mar 2024
  • "xAI will open source Grok"

    3 projects | news.ycombinator.com | 11 Mar 2024
  • How to debug Django inside a Docker container with VSCode

    6 projects | dev.to | 26 Jan 2024
  • Apache Spark VS quix-streams - a user suggested alternative

    2 projects | 7 Dec 2023