audiophile-e2e-pipeline
finnhub-streaming-data-pipeline
audiophile-e2e-pipeline | finnhub-streaming-data-pipeline | |
---|---|---|
3 | 2 | |
170 | 250 | |
- | - | |
0.0 | 5.6 | |
over 1 year ago | 6 months ago | |
Python | HCL | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
audiophile-e2e-pipeline
- Where can I find online projects end-to-end?
-
Celebrating my first Data Engineering Project -- Fitbit data with PySpark, GCP, prefect, and terraform!
ris-tlp adiophile-e2e-pipeline
- Built and automated a complete end-to-end ELT pipeline using AWS, Airflow, dbt, Terraform, Metabase and more as a beginner project!
finnhub-streaming-data-pipeline
-
Reddit Sentiment Analysis Real-Time* Data Pipeline
I didn't use any specific guide. It was mostly build, test, integrate and repeat for each component. For some of them, I went through official documentation on getting started with each application and implemented it in the cluster. However, I reckon you can find other tutorials to setup each application by itself. A few github projects helped me in planning the project architecture and codebase structure like https://github.com/RSKriegs/finnhub-streaming-data-pipeline and https://gitlab.fit.cvut.cz/kozlovit/ni-dip-project-kozlovit.
- Where can I find online projects end-to-end?
What are some alternatives?
data-engineering-zoomcamp - Free Data Engineering course!
Reddit-API-Pipeline
ghcn-d - Data Pipeline from the Global Historical Climatology Network DataSet
streamify - A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
surf_dash
data_engineering_project_1 - My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform
reddit-streaming-pipeline - A real-time reddit data streaming pipeline for sentiment analysis of various subreddits
stream-iot - An end-to-end workflow for processing streaming data on Azure.
StravaDataPipline - :arrows_counterclockwise: :running: EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow