how to get into data eng pt.2

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • superset

    Apache Superset is a Data Visualization and Data Exploration Platform

    There are other ways for people to consume data, one of the major ones (maybe the major one?) is dashboards. You're going to have to give people some dashboarding solution because they want to fill the world with pie charts and you don't have the power to stop them. I've not worked with it much, but Superset seems alright.

  • Apache Hive

    Apache Hive

    How to flesh this idea out more? Start ingesting different types of events and putting them in different tables. Definitely try an OLAP solution (Hive? Most of my experience sits with closed source projects that only a company can afford) and think about column optimised file formats. Get the raw data into an OLAP solution and do aggregations into another systems based off of the OLAP solution.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts