cicd-templates
dbx
cicd-templates | dbx | |
---|---|---|
1 | 5 | |
167 | 434 | |
- | 2.3% | |
5.7 | 4.6 | |
over 2 years ago | 2 months ago | |
Python | Python | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cicd-templates
-
Databricks Connect and GitHub Actions
That's what Databricks Labs have done in this example I happened to find very shortly after posting this question: https://github.com/databrickslabs/cicd-templates . They run with local pyspark and dbx for launching jobs instead
dbx
-
Snowpark equivalent on Databricks?
Pyspark is the python API for spark. You can write code in a notebook on databricks and run it on a cluster or you can write code in an IDE and run it using dbx through the dbx execute command. If you’re more familiar with Pandas API, you can use Koalas which is a pandas API on Spark
- how/where do you define your databricks jobs, tasks and workflows?
-
Unit & integration testing in Databricks
Hey, Databricks person here. Check out DBX for a template on how to do unit and integration tests: https://github.com/databrickslabs/dbx
-
My top 5 learnings from driving an OSS project
Approximately 1 year ago I've released the first version of dbx - a CLI tool for simple and efficient development and deployment of Databricks jobs.
- Anyone use Pyspark notebook in production ?
What are some alternatives?
megalinter - 🦙 MegaLinter analyzes 50 languages, 22 formats, 21 tooling formats, excessive copy-pastes, spelling mistakes and security issues in your repository sources with a GitHub Action, other CI tools or locally.
databricks-cli - The missing command line client for Databricks SQL
megalinter - 🦙 Mega-Linter analyzes 49 languages, 22 formats, 21 tooling formats, excessive copy-pastes, spelling mistakes and security issues in your repository sources with a GitHub Action, other CI tools or locally. [Moved to: https://github.com/oxsecurity/megalinter]
jupyterlab-integration - DEPRECATED: Integrating Jupyter with Databricks via SSH
nutter - Testing framework for Databricks notebooks
fastdbfs - fastdbfs - An interactive command line client for Databricks DBFS.
setup-spark - :octocat:✨ Setup Apache Spark in GitHub Action workflows
databricks-nutter-projects-demo - Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline [Moved to: https://github.com/alexott/databricks-nutter-repos-demo]
azure-cdn-ips - List of Azure CDN IP Addresses
Redash - Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.