f1-data-pipeline
weather_data_pipeline
f1-data-pipeline | weather_data_pipeline | |
---|---|---|
1 | 1 | |
23 | 3 | |
- | - | |
6.8 | 4.2 | |
10 months ago | about 1 year ago | |
Python | Python | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
f1-data-pipeline
weather_data_pipeline
-
Building a Weather Data Pipeline with PySpark, Prefect, and Google Cloud
We'll be using PySpark for distributed data processing, Prefect for workflow management, and Google Cloud Storage and BigQuery for data storage and processing.The code is available on github.
What are some alternatives?
dbt2looker - Generate lookml for views from dbt models
magic-the-gathering - A complete pipeline to pull data from Scryfall's "Magic: The Gathering"-API, via Prefect orchestration and dbt transformation.
astro - Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow. [Moved to: https://github.com/astronomer/astro-sdk]
prefect-deployment-patterns - Code examples showing flow deployment to various types of infrastructure
steam-data-engineering - A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!
youtube_data_analysis - Created an optimised pipeline to provide accurate data for analysis, then used snowsight (provided by Snowflake) to create a dashboard.
dataproc-templates - Dataproc templates and pipelines for solving simple in-cloud data tasks
maternal-health-risk - Maternal Health Risk prediction MLOps pipeline
dbt-coves - CLI tool for dbt users to simplify creation of staging models (yml and sql) files
Prefect - The easiest way to build, run, and monitor data pipelines at scale.