starthinker
astro
starthinker | astro | |
---|---|---|
1 | 2 | |
166 | 183 | |
- | - | |
2.8 | 10.0 | |
8 days ago | over 1 year ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
starthinker
-
Connect CM360 - B=Google Big Query
I've used Google's StarThinker to move reports and it's quite handy, but you need some google cloud/general code experience to get it to work.
astro
-
After Airflow. Where next for DE?
What I would suggest is if you want an "Airflow 3.0" feel you check out the Astro SDK. My team and I basically spent a year and a half rewriting the Airflow DAG writing experience from the ground up. Completely different feel, highly scalable SQL/python/spark (soon) workflows that basically feel like native python. Way easier to test as well. You can pass dataframes into SQL queries, load data from any supported source to any supported warehouse, and things like lineage are natively supported :)
What are some alternatives?
astro-sdk - Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Dataplane - Dataplane is a data platform that makes it easy to construct a data mesh with automated data pipelines and workflows.
airflow-maintenance-dags - A series of DAGs/Workflows to help maintain the operation of Airflow
django-q2 - A multiprocessing distributed task queue for Django. Django Q2 is a fork of Django Q. Big thanks to Ilan Steemers for starting this project. Unfortunately, development has stalled since June 2021. Django Q2 is the new updated version of Django Q, with dependencies updates, docs updates and several bug fixes. Original repository: https://github.com/Koed00/django-q
Mage - 🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai
ComputerMonitoring_IOT - Uses Python to pull computer statistics and stores data to BigQuery using Pub/Sub, Cloud Functions.
getting-started - This repository is a getting started guide to Singer.
ad_clicker - Google Ads clicker
sqlelf - Explore ELF objects through the power of SQL
flytekit - Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.
typhoon-orchestrator - Create elegant data pipelines and deploy to AWS Lambda or Airflow