sqllineage
flink-cdc
sqllineage | flink-cdc | |
---|---|---|
3 | 4 | |
1,135 | 5,261 | |
- | 2.0% | |
8.6 | 9.6 | |
1 day ago | 5 days ago | |
Python | Java | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sqllineage
- FLaNK Stack Weekly for 12 September 2023
-
Dependency Lineage & Scripting
For the open source there is this library https://github.com/reata/sqllineage.
-
Launch HN: Elementary (YC W22) – Open-source data observability
Is the idea here that it's inspired by re_data due to using dbt transformations underneath or because it's reposted looking nearly the same? (or both?)
Looks like much of the lineage code is also largely a wrapper around this library: https://github.com/reata/sqllineage
Would be curious to understand the project's purpose and unique contributions vs. the underlying dependencies powering it as there seems to be some ambiguity. Is this just a wrapper around dbt transformations and a lineage library in one package? Can I just use them directly?
flink-cdc
- FLaNK Stack Weekly 12 February 2024
- FLaNK Stack Weekly for 12 September 2023
-
Flink CDC / alternatives
Info from: https://github.com/ververica/flink-cdc-connectors
- Flink CDC Connectors
What are some alternatives?
elementary - The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
debezium - Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
re_data - re_data - fix data issues before your users & CEO would discover them 😊
open-interpreter - A natural language interface for computers
dbt-data-reliability - dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
flink-faker - A data generator source connector for Flink SQL based on data-faker.
deequ - Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
FLaNK-Halifax - Community over Code, Apache NiFi, Apache Kafka, Apache Flink, Python, GTFS, Transit, Open Source, Open Data
hrequests - 🚀 Web scraping for humans
MmFLaNK - Mm FLaNK Stack (MXNet, MiNiFi, Flink, NiFi, Kafka, Kudu) for AI-IoT