The Simplest Data Architecture

This page summarizes the projects mentioned and recommended in the original post on dev.to

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. Vagrant

    Vagrant is a tool for building and distributing development environments.

    I do believe that using containers makes a ton of sense in writing data pipelines. You can use the same image to develop and run the pipeline, preventing "it works on my machine" issues. You can test different variations of the image without having to stand up additional infrastructure or potentially breaking the workflows of others who're using the same infrastructure. Finally, knowledge of containerization is increasingly expected of all engineers, while knowledge of other tools that solve similar issues (like Vagrant or Ansible) is less common.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. Ansible

    Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.

    I do believe that using containers makes a ton of sense in writing data pipelines. You can use the same image to develop and run the pipeline, preventing "it works on my machine" issues. You can test different variations of the image without having to stand up additional infrastructure or potentially breaking the workflows of others who're using the same infrastructure. Finally, knowledge of containerization is increasingly expected of all engineers, while knowledge of other tools that solve similar issues (like Vagrant or Ansible) is less common.

  4. Poetry

    Python packaging and dependency management made easy

    Tools like Poetry aim to fix this problem, but vanilla pip can do just fine.

  5. opentofu

    OpenTofu lets you declaratively manage your cloud infrastructure.

    Due to Terraform changing its license in 2023, a truly open-source fork called OpenTofu was created. Though I'll keep using the term "Terraform" below to prevent confusion, I do recommend OpenTofu over Terraform in your implementation.

  6. docs.getdbt.com

    The code behind docs.getdbt.comrepo[description]

    You can learn more about dbt by browsing its docs. You can also browse a real-life example: GitLab's dbt project.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Good Resources For Learning Intermediate and Advanced Linux Skills?

    2 projects | /r/debian | 7 Jan 2023
  • How do you keep Python configurations in sync on more than one system?

    5 projects | /r/Python | 1 Jun 2022
  • Git credentials in a shared virtual machine

    2 projects | /r/git | 14 May 2022
  • A Disposable Local Test Environment is Essential for DevOps / SysAdmin

    5 projects | dev.to | 15 Feb 2022
  • How to test your Ansible playbook with Vagrant

    3 projects | dev.to | 20 Sep 2021

Did you know that Python is
the 1st most popular programming language
based on number of references?