DataEngineeringProject
AdvancedSQLPuzzles
DataEngineeringProject | AdvancedSQLPuzzles | |
---|---|---|
5 | 16 | |
985 | 499 | |
- | - | |
0.0 | 9.2 | |
over 1 year ago | about 1 month ago | |
Python | TSQL | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DataEngineeringProject
- What are your favourite GitHub repos that shows how data engineering should be done?
- Is it me or are beginner-friendly ETL pipeline guides that explain from the ground-up how to incorporate the use of various technologies notoriously difficult to find.
-
Starting A Data Engineering Project Series
News RSS Feeds
-
5 Data Sources for Data Engineering Projects
Lastly, the most readily available data source would be data scraped from the internet. To be slightly less vague, I have outlined a project that web-scrapes new online articles every ten minutes to provide all the latest news curated into one place. This project utilizes a wide variety of relevant data engineering tools, which makes it a great project example. The author of this project is Damian Kliś, and he outlines his model architecture below:
-
Can You Recommend Good Data Engineering Projects
Here is my project that got me a few interviews so far: https://github.com/damklis/DataEngineeringProject
AdvancedSQLPuzzles
-
Not using window functions?
Try advancedsqlpuzzles.com
- I don't see the point in these problems.
- Interview at Square for Data Engineering
- How can I improve my SQL skills if I don't use it in work?
- How do I count the number of series per team?
-
Where can I find exercises to practice my SQL learning?
https://advancedsqlpuzzles.com/ Hackerrank.com Leetcode.com Datalemur.com
- I'm spending a crazy amount of time on hacker rank questions. Is this normal?
-
Looking for more SQL exercises after finishing all the free medium and hard questions on stratascratch
Try this GitHub repository. It has a PDF of 50+ advanced puzzles that are nicely written, along with the solutions in a separate sql script.
- Looking for advanced SQL resources
- SQL Case Studies to Practice
What are some alternatives?
blinkist-scraper - 📚 Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
sp_whoisactive - sp_whoisactive
synapse-s3-storage-provider - Synapse storage provider to fetch and store media in Amazon S3
sql-server-password-secure - Securely storing passwords in SQL-Server for C# and verifying
yaetos - Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified
DataAccessGeneration - Better SQL Server stored procedure calls from C#
amazon-s3-find-and-forget - Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
MachineIntelligence-TextAnalytics-TPLDataFlows - Machine Intelligence using OpenAI, Semantic Kernel, Vector Search, SQL Server
Zillow-Data-Engineering
sql-server-maintenance-solution - SQL Server Maintenance Solution
openwisp-monitoring - Network monitoring system written in Python and Django, designed to be extensible, programmable, scalable and easy to use by end users: once the system is configured, monitoring checks, alerts and metric collection happens automatically.
dp-300-database-administrator - Repository for lab exercises and instructions for Microsoft DP-300 learning content