spark-fast-tests vs databricks-nutter-projects-demo

spark-fast-tests

Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit) (by MrPowers)

databricks-nutter-projects-demo

Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline [Moved to: https://github.com/alexott/databricks-nutter-repos-demo] (by alexott)

Suggest topics

DISCONTINUED

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

spark-fast-tests		databricks-nutter-projects-demo
	Project
6	Mentions	1
418	Stars	48
-	Growth	-
0.0	Activity	6.9
3 months ago	Latest Commit	about 2 years ago
Scala	Language	Python
MIT License	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

spark-fast-tests

Posts with mentions or reviews of spark-fast-tests. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-13.

Lakehouse architecture in Azure Synapse without Databricks?
2 projects | /r/dataengineering | 13 Apr 2023

I was a Databricks user for 5 years and spent 95% of my time developing Spark code in IDEs. See the spark-daria and spark-fast-tests projects as Scala examples. I developed internal libraries with all the business logic. The Databricks notebooks would consist of a few lines of code that would invoke a function in the proprietary Spark codebase. The proprietary Spark codebase would depend on the OSS libraries I developed in parallel.
Well designed scala/spark project
4 projects | /r/scala | 15 Oct 2022

https://github.com/MrPowers/spark-fast-tests https://github.com/97arushisharma/Scala_Practice/tree/master/BigData_Analysis_with_Scala_and_Spark/wikipedia
Unit & integration testing in Databricks
3 projects | /r/dataengineering | 30 Apr 2022

If the majority of your stuff is not UDF-based there is an OS solution to run assertion tests against full data frames called spark-fast-tests. The idea here is similar in that you have a it notebook that calls your actual notebook against a staged input reads the output and compares it to a prefabed expected output. This does take a bit of setup and trial and error but it’s the closest I’ve been able to get to proper automated regression testing in databricks
Show dataengineering: beavis, a library for unit testing Pandas/Dask code
3 projects | /r/dataengineering | 9 Aug 2021

I am the author of spark-fast-tests and chispa, libraries for unit testing Scala Spark / PySpark code.
Ask HN: What are some tools / libraries you built yourself?
264 projects | news.ycombinator.com | 16 May 2021

I built daria (https://github.com/MrPowers/spark-daria) to make it easier to write Spark and spark-fast-tests (https://github.com/MrPowers/spark-fast-tests) to provide a good testing workflow.
quinn (https://github.com/MrPowers/quinn) and chispa (https://github.com/MrPowers/chispa) are the PySpark equivalents.
Built bebe (https://github.com/MrPowers/bebe) to expose the Spark Catalyst expressions that aren't exposed to the Scala / Python APIs.
Also build spark-sbt.g8 to create a Spark project with a single command: https://github.com/MrPowers/spark-sbt.g8
Open source contributions for a Data Engineer?
17 projects | /r/dataengineering | 16 Apr 2021

I've built popular PySpark (quinn, chispa) and Scala Spark (spark-daria, spark-fast-tests) libraries.

databricks-nutter-projects-demo

Posts with mentions or reviews of databricks-nutter-projects-demo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-04-30.

Unit & integration testing in Databricks
3 projects | /r/dataengineering | 30 Apr 2022

You can also use the approach described here https://github.com/alexott/databricks-nutter-projects-demo

What are some alternatives?

When comparing spark-fast-tests and databricks-nutter-projects-demo you can also consider the following projects:

Prefect - The easiest way to build, run, and monitor data pipelines at scale.

dbx - 🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

chispa - PySpark test helper methods with beautiful error messages

soda-sql - Data profiling, testing, and monitoring for SQL accessible data.

airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

sqlfluff - A modular SQL linter and auto-formatter with support for multiple dialects and templated code.

spark-daria - Essential Spark extensions and helper methods ✨😲

dagster - An orchestration platform for the development, production, and observation of data assets.

beavis - Pandas helper functions

snipp.in - Fast, Light-weight, Notes, Snippet manager and code editor directly inside your browser

Metabase - The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

spark-fast-tests vs Prefect databricks-nutter-projects-demo vs dbx spark-fast-tests vs chispa databricks-nutter-projects-demo vs chispa spark-fast-tests vs soda-sql spark-fast-tests vs airbyte spark-fast-tests vs sqlfluff spark-fast-tests vs spark-daria spark-fast-tests vs dagster spark-fast-tests vs beavis spark-fast-tests vs snipp.in spark-fast-tests vs Metabase

Compare spark-fast-tests vs databricks-nutter-projects-demo and see what are their differences.

spark-fast-tests

databricks-nutter-projects-demo

spark-fast-tests

databricks-nutter-projects-demo

What are some alternatives?