Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
DataEngineeringProject Alternatives
Similar projects and alternatives to DataEngineeringProject
-
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
-
AdvancedSQLPuzzles
Welcome to my GitHub repository. I hope you enjoy solving these puzzles as much as I have enjoyed creating them.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
blinkist-scraper
📚 Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
amazon-s3-find-and-forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
-
openwisp-monitoring
Network monitoring system written in Python and Django, designed to be extensible, programmable, scalable and easy to use by end users: once the system is configured, monitoring checks, alerts and metric collection happens automatically.
-
openverse-catalog
Discontinued Identifies and collects data on cc-licensed content across web crawl data and public apis.
-
SF-EvictionTracker
Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.
DataEngineeringProject reviews and mentions
- What are your favourite GitHub repos that shows how data engineering should be done?
- Is it me or are beginner-friendly ETL pipeline guides that explain from the ground-up how to incorporate the use of various technologies notoriously difficult to find.
-
Starting A Data Engineering Project Series
News RSS Feeds
-
5 Data Sources for Data Engineering Projects
Lastly, the most readily available data source would be data scraped from the internet. To be slightly less vague, I have outlined a project that web-scrapes new online articles every ten minutes to provide all the latest news curated into one place. This project utilizes a wide variety of relevant data engineering tools, which makes it a great project example. The author of this project is Damian KliÅ›, and he outlines his model architecture below:
-
Can You Recommend Good Data Engineering Projects
Here is my project that got me a few interviews so far: https://github.com/damklis/DataEngineeringProject
-
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024
Stats
damklis/DataEngineeringProject is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of DataEngineeringProject is Python.
Popular Comparisons
- DataEngineeringProject VS blinkist-scraper
- DataEngineeringProject VS synapse-s3-storage-provider
- DataEngineeringProject VS yaetos
- DataEngineeringProject VS amazon-s3-find-and-forget
- DataEngineeringProject VS Zillow-Data-Engineering
- DataEngineeringProject VS openwisp-monitoring
- DataEngineeringProject VS openverse-catalog
- DataEngineeringProject VS datajob
- DataEngineeringProject VS cryptostore
- DataEngineeringProject VS AdvancedSQLPuzzles
Sponsored