twitter_data-lakehouse_minio_drill_superset
astro
twitter_data-lakehouse_minio_drill_superset | astro | |
---|---|---|
1 | 2 | |
3 | 183 | |
- | - | |
10.0 | 10.0 | |
about 1 year ago | over 1 year ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
twitter_data-lakehouse_minio_drill_superset
astro
-
After Airflow. Where next for DE?
What I would suggest is if you want an "Airflow 3.0" feel you check out the Astro SDK. My team and I basically spent a year and a half rewriting the Airflow DAG writing experience from the ground up. Completely different feel, highly scalable SQL/python/spark (soon) workflows that basically feel like native python. Way easier to test as well. You can pass dataframes into SQL queries, load data from any supported source to any supported warehouse, and things like lineage are natively supported :)
What are some alternatives?
superset - Apache Superset is a Data Visualization and Data Exploration Platform
astro-sdk - Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Airflow - Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
airflow-maintenance-dags - A series of DAGs/Workflows to help maintain the operation of Airflow
Mage - 🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai
getting-started - This repository is a getting started guide to Singer.
sqlelf - Explore ELF objects through the power of SQL
typhoon-orchestrator - Create elegant data pipelines and deploy to AWS Lambda or Airflow
f1-data-pipeline - F1 Data Pipeline
mais - ⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/mais/
pathy - simple, flexible, offline capable, cloud storage with a Python path-like interface
grafana-backup-tool - A Python-based application to backup Grafana settings by using the Grafana API