Whats something hot rn or whats going to be next thing we should focus on in data engineering?

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • materialize

    The data warehouse for operational workloads. (by MaterializeInc)

  • In terms of cutting edge tech, I'd say the companies in the data streaming space are doing some pretty cool stuff e.g https://materialize.com/

  • monosi

    Open source data observability platform

  • Ah ok cool, well I guess you can say a lot of these tools that are becoming big with the modern data stack provide some form of automation. E.g Fivetran / Airbyte extract data on an automated schedule, then you have dbt with the transformations, and then the reverse ETLs like Hightouch / Census that run on an automated a schedule as well. I think it's pretty much becoming somewhat of a standard now to have, e.g. with what I'm building we included a scheduler for automation from the start.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • ploomber

    The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

  • Ploomber :) We launched on HN yesterday!

  • projects

    Sample projects using Ploomber. (by ploomber)

  • Yes! (tell your friend). You can write shell scripts so you can execute that 2002 code :) You can test it locally and then run it in AWS Batch/Argo. Here's an example

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts