nutter vs dbx

nutter

Testing framework for Databricks notebooks (by microsoft)

Source Code

Suggest alternative

Edit details

dbx

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management. (by databrickslabs)

Databricks databricks-api databricks-cli CI Cicd Mlops

Source Code

dbx.readthedocs.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

nutter		dbx
	Project
2	Mentions	5
262	Stars	435
2.3%	Growth	2.5%
0.0	Activity	4.6
24 days ago	Latest Commit	3 months ago
Python	Language	Python
MIT License	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

nutter

Posts with mentions or reviews of nutter. We have used some of these posts to build our list of alternatives and similar projects.

How much object orienteered do you use in your projects? Bonus points for integration and unit tests
1 project | /r/dataengineering | 19 Mar 2023

From my experience OO gives you much more flexibility in designing your pipeline but you're risking to make the project way more complicated. The worst example I have seen is the Nutter library (https://github.com/microsoft/nutter), which uses endless classes that are all nested in each other. I once had a bug when using it, and it was a huge pain in the ass to understand what's going on when the code is executed. It is a very good example of what can go wrong when you're overusing OO. However, in one project, I carefully created few classes, just out of curiosity, and I was very impressed how it helped me to organize/structure my code. A functions hase a clear dedicated use, but a good class is like a Swiss army knife with an solid set of functionalities. If you know how to use it in a smart way, you are likely to increase the quality of your code, but the contrary is also very likely, especially when the team members are not ready for it.
How do you test your pipelines?
1 project | /r/dataengineering | 25 Oct 2021

- https://github.com/microsoft/nutter

dbx

Posts with mentions or reviews of dbx. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-11-07.

Snowpark equivalent on Databricks?
2 projects | /r/dataengineering | 7 Nov 2022

Pyspark is the python API for spark. You can write code in a notebook on databricks and run it on a cluster or you can write code in an IDE and run it using dbx through the dbx execute command. If you’re more familiar with Pandas API, you can use Koalas which is a pandas API on Spark
how/where do you define your databricks jobs, tasks and workflows?
1 project | /r/dataengineering | 1 Nov 2022
Unit & integration testing in Databricks
3 projects | /r/dataengineering | 30 Apr 2022

Hey, Databricks person here. Check out DBX for a template on how to do unit and integration tests: https://github.com/databrickslabs/dbx
My top 5 learnings from driving an OSS project
1 project | dev.to | 3 Apr 2022

Approximately 1 year ago I've released the first version of dbx - a CLI tool for simple and efficient development and deployment of Databricks jobs.
Anyone use Pyspark notebook in production ?
1 project | /r/dataengineering | 19 Dec 2021

What are some alternatives?

When comparing nutter and dbx you can also consider the following projects:

cicd-templates - Manage your Databricks deployments and CI with code.

databricks-cli - The missing command line client for Databricks SQL

dbt-databricks - A dbt adapter for Databricks.

azure-devops-python-api - Azure DevOps Python API

jupyterlab-integration - DEPRECATED: Integrating Jupyter with Databricks via SSH

Redash - Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

fastdbfs - fastdbfs - An interactive command line client for Databricks DBFS.

terraform-provider-azuredevops - Terraform Azure DevOps provider

databricks-nutter-projects-demo - Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline [Moved to: https://github.com/alexott/databricks-nutter-repos-demo]