Built and automated a complete end-to-end ELT pipeline using AWS, Airflow, dbt, Terraform, Metabase and more as a beginner project!

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • terraform

    Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared amongst team members, treated as code, edited, reviewed, and versioned.

    Thanks! I configured my AWS account credentials through the AWS CLI first. Then I used Terraform to create all the services on AWS. You can find all the files relating to setting up the cloud environment under `terraform/`. You can create GCP resources through Terraform as well. All the resources used in the project come under the AWS free tier, so I didn't incur any costs throughout the entire project!

  • audiophile-e2e-pipeline

    Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • pydantic

    Data validation using Python type hints

    Initial data parsing and validation through Pydantic to generate silver data.

  • Airflow

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

    Infrastructure provisioning through Terraform, containerized through Docker and orchestrated through Airflow. Created dashboard through Metabase.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts