cicd-templates
nutter
cicd-templates | nutter | |
---|---|---|
1 | 2 | |
167 | 262 | |
- | 2.3% | |
5.7 | 0.0 | |
over 2 years ago | 12 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cicd-templates
-
Databricks Connect and GitHub Actions
That's what Databricks Labs have done in this example I happened to find very shortly after posting this question: https://github.com/databrickslabs/cicd-templates . They run with local pyspark and dbx for launching jobs instead
nutter
-
How much object orienteered do you use in your projects? Bonus points for integration and unit tests
From my experience OO gives you much more flexibility in designing your pipeline but you're risking to make the project way more complicated. The worst example I have seen is the Nutter library (https://github.com/microsoft/nutter), which uses endless classes that are all nested in each other. I once had a bug when using it, and it was a huge pain in the ass to understand what's going on when the code is executed. It is a very good example of what can go wrong when you're overusing OO. However, in one project, I carefully created few classes, just out of curiosity, and I was very impressed how it helped me to organize/structure my code. A functions hase a clear dedicated use, but a good class is like a Swiss army knife with an solid set of functionalities. If you know how to use it in a smart way, you are likely to increase the quality of your code, but the contrary is also very likely, especially when the team members are not ready for it.
-
How do you test your pipelines?
- https://github.com/microsoft/nutter
What are some alternatives?
megalinter - 🦙 MegaLinter analyzes 50 languages, 22 formats, 21 tooling formats, excessive copy-pastes, spelling mistakes and security issues in your repository sources with a GitHub Action, other CI tools or locally.
dbt-databricks - A dbt adapter for Databricks.
megalinter - 🦙 Mega-Linter analyzes 49 languages, 22 formats, 21 tooling formats, excessive copy-pastes, spelling mistakes and security issues in your repository sources with a GitHub Action, other CI tools or locally. [Moved to: https://github.com/oxsecurity/megalinter]
dbx - 🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
azure-devops-python-api - Azure DevOps Python API
databricks-cli - The missing command line client for Databricks SQL
Redash - Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
setup-spark - :octocat:✨ Setup Apache Spark in GitHub Action workflows
terraform-provider-azuredevops - Terraform Azure DevOps provider
azure-cdn-ips - List of Azure CDN IP Addresses
fastdbfs - fastdbfs - An interactive command line client for Databricks DBFS.