Our great sponsors
-
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
System design interviews can definitely be tough especially if weren't expecting one or haven't had one before. I'd recommend you check out this system design interview GitHub repo, it'll help you get the basics down for system design interviews.
If you're looking to better understand execution plans for spark I'd start by reading Spark the Definitive Guide, it's a bit higher level and broader than I'd like but will give you a good overview of spark's design. If you don't want to get a book this is repo does a good job breaking down how spark develops physical and logical plans https://github.com/JerryLead/SparkInternals.
One place to look are the projects repo's and docs, once you have a good idea of how the system is architected poking around pieces of the codebase can be helpful in letting you really understand their internals. I personally enjoy going through spark repo and trino repo and the documentation for both projects is decent and can answer many of your questions.
One place to look are the projects repo's and docs, once you have a good idea of how the system is architected poking around pieces of the codebase can be helpful in letting you really understand their internals. I personally enjoy going through spark repo and trino repo and the documentation for both projects is decent and can answer many of your questions.