fastdbfs
fastdbfs - An interactive command line client for Databricks DBFS. (by salva)
nutter
Testing framework for Databricks notebooks (by microsoft)
Our great sponsors
fastdbfs | nutter | |
---|---|---|
1 | 2 | |
4 | 261 | |
- | 3.1% | |
0.0 | 0.0 | |
almost 3 years ago | 8 days ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
fastdbfs
Posts with mentions or reviews of fastdbfs.
We have used some of these posts to build our list of alternatives
and similar projects.
-
fastdbfs - An interactive command line client for Databricks DBFS
fastdbfs is an interactive command line client for accessing Databricks DBFS. It aims to be much more friendly and faster than the official CLI tool and also feature rich.
nutter
Posts with mentions or reviews of nutter.
We have used some of these posts to build our list of alternatives
and similar projects.
-
How much object orienteered do you use in your projects? Bonus points for integration and unit tests
From my experience OO gives you much more flexibility in designing your pipeline but you're risking to make the project way more complicated. The worst example I have seen is the Nutter library (https://github.com/microsoft/nutter), which uses endless classes that are all nested in each other. I once had a bug when using it, and it was a huge pain in the ass to understand what's going on when the code is executed. It is a very good example of what can go wrong when you're overusing OO. However, in one project, I carefully created few classes, just out of curiosity, and I was very impressed how it helped me to organize/structure my code. A functions hase a clear dedicated use, but a good class is like a Swiss army knife with an solid set of functionalities. If you know how to use it in a smart way, you are likely to increase the quality of your code, but the contrary is also very likely, especially when the team members are not ready for it.
-
How do you test your pipelines?
- https://github.com/microsoft/nutter
What are some alternatives?
When comparing fastdbfs and nutter you can also consider the following projects:
dbx - 🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
cicd-templates - Manage your Databricks deployments and CI with code.
koalas - Koalas: pandas API on Apache Spark
dbt-databricks - A dbt adapter for Databricks.
Redash - Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
azure-devops-python-api - Azure DevOps Python API
terraform-provider-azuredevops - Terraform Azure DevOps provider
databricks-cli - The missing command line client for Databricks SQL