Uber Interview Experience/Asking Suggestions

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • system-design-primer

    Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

  • System design interviews can definitely be tough especially if weren't expecting one or haven't had one before. I'd recommend you check out this system design interview GitHub repo, it'll help you get the basics down for system design interviews.

  • SparkInternals

    Notes talking about the design and implementation of Apache Spark

  • If you're looking to better understand execution plans for spark I'd start by reading Spark the Definitive Guide, it's a bit higher level and broader than I'd like but will give you a good overview of spark's design. If you don't want to get a book this is repo does a good job breaking down how spark develops physical and logical plans https://github.com/JerryLead/SparkInternals.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

  • One place to look are the projects repo's and docs, once you have a good idea of how the system is architected poking around pieces of the codebase can be helpful in letting you really understand their internals. I personally enjoy going through spark repo and trino repo and the documentation for both projects is decent and can answer many of your questions.

  • Trino

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

  • One place to look are the projects repo's and docs, once you have a good idea of how the system is architected poking around pieces of the codebase can be helpful in letting you really understand their internals. I personally enjoy going through spark repo and trino repo and the documentation for both projects is decent and can answer many of your questions.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts