dbx
Redash
dbx | Redash | |
---|---|---|
5 | 38 | |
434 | 24,994 | |
2.3% | 0.8% | |
4.6 | 9.5 | |
2 months ago | 4 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | BSD 2-clause "Simplified" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dbx
-
Snowpark equivalent on Databricks?
Pyspark is the python API for spark. You can write code in a notebook on databricks and run it on a cluster or you can write code in an IDE and run it using dbx through the dbx execute command. If you’re more familiar with Pandas API, you can use Koalas which is a pandas API on Spark
- how/where do you define your databricks jobs, tasks and workflows?
-
Unit & integration testing in Databricks
Hey, Databricks person here. Check out DBX for a template on how to do unit and integration tests: https://github.com/databrickslabs/dbx
-
My top 5 learnings from driving an OSS project
Approximately 1 year ago I've released the first version of dbx - a CLI tool for simple and efficient development and deployment of Databricks jobs.
- Anyone use Pyspark notebook in production ?
Redash
- Redash: Connect to data source, easily visualize, dashboard and share your data
- FLaNK Stack 26 February 2024
- Contribuir con proyectos Open Source
-
Auto reloading Odoo with Docker
It seems like there may be an issue with Watchdog on Apple Silicon.
-
Tool or service for querying and exposing database through API
I am looking for service or tool similiar to Metabase or Redash that allows me to add data source - for example Postgres connection, and create raw SQL queries that can be shared or exposed through API. So instead of keeping raw SQL code somewhere, my other service would call this tool e.g. http://microservice/query=1?param1=xx&page=2 and get the results from the DB. These calls are internal only and part of ETL processes, but of course authentication would be required.
-
A PostgreSQL Docker container that automatically upgrades PostgreSQL
Yeah, a lot of the time I'd agree with you.
This container came about for the Redash project (https://github.com/getredash/redash), which has been stuck on PostgreSQL 9.5 (!) for years.
Moving to a new PostgreSQL container version is easy enough for new installations, but rolling that kind of change out to an existing userbase isn't so pretty.
For people familiar with the command line, PostgreSQL, and Docker then no worries.
But a large number of Redash deployments seem to have been done by people not skilled in those things. "We deployed it from the Digital Ocean droplet / AWS image / etc!"
For those situations, something that takes care of the database upgrade process automatically is the better approach. :)
-
Did anyone try Openblocks for multi-tenant client reporting?
I have tried Metabase, Redash beore (both self hosted open source versions), from my experience I find Metabase a bit easy to work with.
-
Best apps for transitioning from Spreadsheets to SQLite?
Regarding visualization tools, sqliteviz has proven to be the best I've found so far. Their web app runs locally but has some trackers, so I run it locally via a simple, static HTTP server. Falcon and Redash seem like overkill for my needs.
-
Chartbrew – create live reporting dashboards from APIs, MongoDB, Firestore, etc.
Redash seems to be dead or at least in hibernation. There hasn't been a release in over a year.
https://github.com/getredash/redash/issues/5891
-
Real Time Data Infra Stack
redash
What are some alternatives?
databricks-cli - The missing command line client for Databricks SQL
Apache Superset - Apache Superset is a Data Visualization and Data Exploration Platform [Moved to: https://github.com/apache/superset]
cicd-templates - Manage your Databricks deployments and CI with code.
Metabase - The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
jupyterlab-integration - DEPRECATED: Integrating Jupyter with Databricks via SSH
plotly - The interactive graphing library for Python :sparkles: This project now includes Plotly Express!
nutter - Testing framework for Databricks notebooks
cube.js - 📊 Cube — The Semantic Layer for Building Data Applications
fastdbfs - fastdbfs - An interactive command line client for Databricks DBFS.
bokeh - Interactive Data Visualization in the browser, from Python
databricks-nutter-projects-demo - Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline [Moved to: https://github.com/alexott/databricks-nutter-repos-demo]
Druid - Apache Druid: a high performance real-time analytics database.