Evolutionary Data Infrastructure

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

kafka-connect-bigquery

2 76 8.6 Java

A Kafka Connect BigQuery sink connector

But is there a way to solve the schema problem? The answer is, yes, through KCBQ.

flink-statefun

18 495 5.1 Java

Apache Flink Stateful Functions

Therefore, I still recommend using a streaming framework such as Apache Flink or Apache Kafka Streams.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
debezium

80 9,907 9.9 Java

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

We use Debezium to capture changes to each database and send the streams to Kafka, and later KCBQ subscribes to the Kafka streams and archives them to BigQuery.

nodejs-bigquery

43 457 8.0 TypeScript

Node.js client for Google Cloud BigQuery: A fast, economical and fully-managed enterprise data warehouse for large-scale data analytics.

In addition, batch tasks require knowledge of the data schema of each service in order to get the data correctly and save it to the corresponding warehouse table. Assuming our data warehouse is GCP BigQuery, the schema in the warehouse table also needs to be created and modified manually.

Airflow

169 34,570 10.0 Python

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

For batch processing, I recommend using Apache Airflow, which is easy to manage and easy to script for various DAGs to address the needs of multiple batch processing scenarios.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Migrating to Snowflake, Redshift, or BigQuery? Use Datafold to Avoid these Common Pitfalls

2 projects | dev.to | 15 Dec 2021
Data Analytics at Potloc I: Making data integrity your priority with Elementary & Meltano

4 projects | dev.to | 5 Jan 2023
You can't leak users' data if you don't hold it

2 projects | news.ycombinator.com | 30 Mar 2024
NodeJS Security Best Practices

11 projects | dev.to | 19 Feb 2024
Building in Public: Leveraging Tublian's AI Copilot for My Open Source Contributions

1 project | dev.to | 12 Feb 2024

Evolutionary Data Infrastructure

This page summarizes the projects mentioned and recommended in the original post on dev.to
Database Workflow engine NodeJS change-data-capture Airflow
Post date: 26 Sep 2022

kafka-connect-bigquery

flink-statefun

InfluxDB

debezium

nodejs-bigquery

Airflow

SaaSHub

Related posts

Migrating to Snowflake, Redshift, or BigQuery? Use Datafold to Avoid these Common Pitfalls

Data Analytics at Potloc I: Making data integrity your priority with Elementary & Meltano

You can't leak users' data if you don't hold it

NodeJS Security Best Practices

Building in Public: Leveraging Tublian's AI Copilot for My Open Source Contributions

Evolutionary Data Infrastructure

This page summarizes the projects mentioned and recommended in the original post on dev.to Database Workflow engine NodeJS change-data-capture Airflow Post date: 26 Sep 2022

kafka-connect-bigquery

flink-statefun

InfluxDB

debezium

nodejs-bigquery

Airflow

SaaSHub

Related posts

Migrating to Snowflake, Redshift, or BigQuery? Use Datafold to Avoid these Common Pitfalls

Data Analytics at Potloc I: Making data integrity your priority with Elementary & Meltano

You can't leak users' data if you don't hold it

NodeJS Security Best Practices

Building in Public: Leveraging Tublian's AI Copilot for My Open Source Contributions

This page summarizes the projects mentioned and recommended in the original post on dev.to
Database Workflow engine NodeJS change-data-capture Airflow
Post date: 26 Sep 2022