finnhub-streaming-data-pipeline vs data-engineering-zoomcamp

finnhub-streaming-data-pipeline

Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more (by RSKriegs)

Suggest topics

Source Code

Suggest alternative

Edit details

data-engineering-zoomcamp

Free Data Engineering course! (by DataTalksClub)

data-engineering Kafka Spark dbt Docker prefect

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

finnhub-streaming-data-pipeline		data-engineering-zoomcamp
	Project
2	Mentions	119
250	Stars	22,811
-	Growth	3.4%
5.6	Activity	9.4
6 months ago	Latest Commit	25 days ago
HCL	Language	Jupyter Notebook
-	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

finnhub-streaming-data-pipeline

Posts with mentions or reviews of finnhub-streaming-data-pipeline. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-26.

Reddit Sentiment Analysis Real-Time* Data Pipeline
3 projects | /r/dataengineering | 26 May 2023

I didn't use any specific guide. It was mostly build, test, integrate and repeat for each component. For some of them, I went through official documentation on getting started with each application and implemented it in the cluster. However, I reckon you can find other tutorials to setup each application by itself. A few github projects helped me in planning the project architecture and codebase structure like https://github.com/RSKriegs/finnhub-streaming-data-pipeline and https://gitlab.fit.cvut.cz/kozlovit/ni-dip-project-kozlovit.
Where can I find online projects end-to-end?
6 projects | /r/dataengineering | 21 Mar 2023

data-engineering-zoomcamp

Posts with mentions or reviews of data-engineering-zoomcamp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-28.

Data Engineering Zoomcamp Week 6 - using redpanda 1
1 project | dev.to | 9 Apr 2024

References: Data engineering zoomcamp week 6 course and homework notes: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/cohorts/2024/06-streaming
Final project part 5
1 project | dev.to | 3 Apr 2024

dbt is the main part of my data engineering project for Data Talks Club's data engineering zoomcamp. After a few frustrating errors on my part, I finally figured out how to make models, where to put the staging models and where to put the core models, how to compile a seed file, and how to join it to the main file in order to produce data for visualization. I also used the git interface to continually upgrade my repository. This was extremely convenient and helpful.
Building a project in DBT
1 project | dev.to | 23 Feb 2024

For Week 4 of DataTalksClub's data engineering zoomcamp, we had to install dbt and create a project. This was a formidable task. dbt is a data transformation tool that enables data analysts and engineers to transform data in a cloud analytics warehouse, BigQuery in our case. It took me a very long time to do this, and in this case I needed the homework extension.
Testing and documenting DBT models
1 project | dev.to | 23 Feb 2024

In this video we learned how to test and document dbt models. We also learned about the codegen library. This is part of Week 4 of the data engineering zoomcamp by DataTalksClub.
Extracting data with dlt
1 project | dev.to | 15 Feb 2024

If you want to run these commands yourself, either in a Jupyter notebook or in Google Colab, you can get the file from HERE. You can get an overview of the workshop HERE. When I ran in a Jupyter notebook, I had to delete the first line (%%capture) and put quotes around dlt[duckdb] in the second line.
Data engineering at home?
1 project | /r/dataengineering | 10 Dec 2023

Take a look.DE zoomcamp
Rockstar Data Engineers making big bucks: what are you doing exactly?
1 project | /r/dataengineering | 9 Dec 2023

If you need guidance you can attend the data engineering zoomcamp, it's free and quite solid.
Self study material
1 project | /r/dataengineering | 17 Aug 2023

Welcome. Start with Data Engineering Zoomcamp, try and build a project, see if you like it, then continue to get into deeper resources.
What is the best way to learn Python if I want to become a data engineer
2 projects | /r/Python | 28 May 2023

Can take a look at this - https://github.com/DataTalksClub/data-engineering-zoomcamp
Course Recommendations for a New Grad
1 project | /r/datascience | 28 May 2023

I think you can start with something free with this pretty practical course on Data Engineering from DataTalksClub - https://github.com/DataTalksClub/data-engineering-zoomcamp

What are some alternatives?

When comparing finnhub-streaming-data-pipeline and data-engineering-zoomcamp you can also consider the following projects:

Reddit-API-Pipeline

mlops-zoomcamp - Free MLOps course from DataTalks.Club

streamify - A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!

Cookbook - The Data Engineering Cookbook

surf_dash

AdventureWorks - Projects using the AdventureWorks database

reddit-streaming-pipeline - A real-time reddit data streaming pipeline for sentiment analysis of various subreddits

versatile-data-kit - One framework to develop, deploy and operate data workflows with Python and SQL.

audiophile-e2e-pipeline - Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.

udacity-capstone

DataEngineerZoomCamp - I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.