data-lineage
sqllineage
data-lineage | sqllineage | |
---|---|---|
1 | 3 | |
296 | 1,126 | |
1.0% | - | |
2.0 | 8.6 | |
9 months ago | 8 days ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
data-lineage
-
Column level lineage
Link: https://github.com/tokern/data-lineage
sqllineage
- FLaNK Stack Weekly for 12 September 2023
-
Dependency Lineage & Scripting
For the open source there is this library https://github.com/reata/sqllineage.
-
Launch HN: Elementary (YC W22) – Open-source data observability
Is the idea here that it's inspired by re_data due to using dbt transformations underneath or because it's reposted looking nearly the same? (or both?)
Looks like much of the lineage code is also largely a wrapper around this library: https://github.com/reata/sqllineage
Would be curious to understand the project's purpose and unique contributions vs. the underlying dependencies powering it as there seems to be some ambiguity. Is this just a wrapper around dbt transformations and a lineage library in one package? Can I just use them directly?
What are some alternatives?
lux - Automatically visualize your pandas dataframe via a single print! 📊 💡
elementary - The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
lux - 👾 Fast and simple video download library and CLI tool written in Go
re_data - re_data - fix data issues before your users & CEO would discover them 😊
ipython - Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
dbt-data-reliability - dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
deequ - Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
hrequests - 🚀 Web scraping for humans
open-interpreter - A natural language interface for computers
rivet - The open-source visual AI programming environment and TypeScript library
RecipeUI - Discover, test, and share APIs in seconds
bedframe - Your Browser Extension Development Framework