Looking for open source projects that use data pipelines and big data flows

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. meltano

    I know really sure if this is what are you looking for, but take a look at Meltano

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. pipelinewise

    Data Pipeline Framework using the singer.io spec

  4. nifi

    Apache NiFi

    apache nifi is a good match for what you are trying to find I think. It's also opensource so you can contribute as well.

  5. Logstash

    Logstash - transport and process your logs, events, or other data

    Is logstash the kind of project you are looking for? https://github.com/elastic/logstash

  6. jet-train

  7. electricitymaps-contrib

    The open source repository for Electricity Maps App and data parsers that enables a real-time visualisation of the CO2 emissions of electricity consumption

  8. grouparoo

    Discontinued 🦘 The Grouparoo Monorepo - open source customer data sync framework

    The product I'm working on called Grouparoo might be a good fit: https://github.com/grouparoo/grouparoo

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • FLaNK 25 December 2023

    33 projects | dev.to | 26 Dec 2023
  • 10+ Open-Source Projects For Web Developers In 2023

    14 projects | dev.to | 10 Apr 2023
  • Build a data ingestion pipeline using Kafka, Flink, and CrateDB

    6 projects | dev.to | 10 May 2021
  • Create a simple REST application using Quarkus

    4 projects | dev.to | 7 May 2025
  • Quarkus 3 application on AWS Lambda- Part 1 Introduction to the sample application and first Lambda performance measurements

    6 projects | dev.to | 5 May 2025

Did you know that Python is
the 2nd most popular programming language
based on number of references?