Architecture suggestions for project

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • r-place-2022

  • https://github.com/rv02/r-place-2022/blob/main/architecture.drawio.png A general overview for the pipeline and analysis for the recent r/place event data analysis. Some soggestions by random redditor - https://www.reddit.com/r/place/comments/txvk2d/comment/i3ogrbt/?utm_source=share&utm_medium=web2x&context=3 I got some basics covered at DEZoompCamp recently, they heavily used Google Cloud but I think I wanna go with AWS, also practice IAC and use Terraform. I am thinking of using Spark for analysis, or should I opt for DBT? Also I don't want to get too much bill, (student) the files zipped are around 11gb. Would GCP be cheaper?

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts