Why do companies still build data ingestion tooling instead of using a third-party tool like Airbyte?

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • getting-started

    This repository is a getting started guide to Singer. (by singer-io)

  • Coincidently, I saw a presentation today on a nice half-way-house solution: using embeddable Python libraries like Sling and dlt - both open-source. See https://www.youtube.com/watch?v=gAqOLgG2iYY There is also singer.io which is more of a protocol than a library, but can also be installed although it looks like it is a true community effort and not so well maintained.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Design patter for Python ETL

    2 projects | /r/dataengineering | 2 Dec 2022
  • Basic data engineering question.

    2 projects | /r/dataengineering | 16 Oct 2022
  • I have hundreds of API data endpoints with different schemas. How do I organize?

    1 project | /r/dataengineering | 10 Oct 2022
  • CDC (Change Data Capture) with 3rd party APIs

    1 project | /r/dataengineering | 23 Sep 2022
  • Questions about Integration Singer Specification with AWS Glue

    1 project | /r/dataengineering | 26 Aug 2022