streamify
audiophile-e2e-pipeline
streamify | audiophile-e2e-pipeline | |
---|---|---|
4 | 3 | |
474 | 170 | |
- | - | |
0.0 | 0.0 | |
about 2 years ago | over 1 year ago | |
Python | Python | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
streamify
- Where can I find online projects end-to-end?
-
Completed my first Data Engineering project with Kafka, Spark, GCP, Airflow, dbt, Terraform, Docker and more!
Here is link number 1 - Previous text "Git"
audiophile-e2e-pipeline
- Where can I find online projects end-to-end?
-
Celebrating my first Data Engineering Project -- Fitbit data with PySpark, GCP, prefect, and terraform!
ris-tlp adiophile-e2e-pipeline
- Built and automated a complete end-to-end ELT pipeline using AWS, Airflow, dbt, Terraform, Metabase and more as a beginner project!
What are some alternatives?
eventsim - Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
data-engineering-zoomcamp - Free Data Engineering course!
terraform - Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared amongst team members, treated as code, edited, reviewed, and versioned.
ghcn-d - Data Pipeline from the Global Historical Climatology Network DataSet
Airflow - Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Reddit-API-Pipeline
eventsim - Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
data_engineering_project_1 - My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform
finnhub-streaming-data-pipeline - Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more
stream-iot - An end-to-end workflow for processing streaming data on Azure.
tfl-bikes-data-pipeline - Processing TFL data for bike usage with Google Cloud Platform.
StravaDataPipline - :arrows_counterclockwise: :running: EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow