AdvancedSQLPuzzles
DataEngineeringProject
Our great sponsors
AdvancedSQLPuzzles | DataEngineeringProject | |
---|---|---|
16 | 5 | |
490 | 985 | |
- | - | |
9.2 | 0.0 | |
about 1 month ago | over 1 year ago | |
TSQL | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AdvancedSQLPuzzles
-
Not using window functions?
Try advancedsqlpuzzles.com
- I don't see the point in these problems.
- Interview at Square for Data Engineering
- How can I improve my SQL skills if I don't use it in work?
- How do I count the number of series per team?
-
Where can I find exercises to practice my SQL learning?
https://advancedsqlpuzzles.com/ Hackerrank.com Leetcode.com Datalemur.com
- I'm spending a crazy amount of time on hacker rank questions. Is this normal?
-
Looking for more SQL exercises after finishing all the free medium and hard questions on stratascratch
Try this GitHub repository. It has a PDF of 50+ advanced puzzles that are nicely written, along with the solutions in a separate sql script.
- Looking for advanced SQL resources
- SQL Case Studies to Practice
DataEngineeringProject
- What are your favourite GitHub repos that shows how data engineering should be done?
- Is it me or are beginner-friendly ETL pipeline guides that explain from the ground-up how to incorporate the use of various technologies notoriously difficult to find.
-
Starting A Data Engineering Project Series
News RSS Feeds
-
5 Data Sources for Data Engineering Projects
Lastly, the most readily available data source would be data scraped from the internet. To be slightly less vague, I have outlined a project that web-scrapes new online articles every ten minutes to provide all the latest news curated into one place. This project utilizes a wide variety of relevant data engineering tools, which makes it a great project example. The author of this project is Damian Kliś, and he outlines his model architecture below:
-
Can You Recommend Good Data Engineering Projects
Here is my project that got me a few interviews so far: https://github.com/damklis/DataEngineeringProject
What are some alternatives?
sp_whoisactive - sp_whoisactive
blinkist-scraper - 📚 Python tool to download book summaries and audio from Blinkist.com, and generate some pretty output
sql-server-password-secure - Securely storing passwords in SQL-Server for C# and verifying
synapse-s3-storage-provider - Synapse storage provider to fetch and store media in Amazon S3
DataAccessGeneration - Better SQL Server stored procedure calls from C#
yaetos - Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified
MachineIntelligence-TextAnalytics-TPLDataFlows - Machine Intelligence using OpenAI, Semantic Kernel, Vector Search, SQL Server
amazon-s3-find-and-forget - Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
sql-server-maintenance-solution - SQL Server Maintenance Solution
Zillow-Data-Engineering
dp-300-database-administrator - Repository for lab exercises and instructions for Microsoft DP-300 learning content
openwisp-monitoring - Network monitoring system written in Python and Django, designed to be extensible, programmable, scalable and easy to use by end users: once the system is configured, monitoring checks, alerts and metric collection happens automatically.