FastMJPG
sqllineage
FastMJPG | sqllineage | |
---|---|---|
3 | 3 | |
172 | 1,148 | |
- | - | |
7.6 | 8.6 | |
30 days ago | 17 days ago | |
C | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
FastMJPG
sqllineage
- FLaNK Stack Weekly for 12 September 2023
-
Dependency Lineage & Scripting
For the open source there is this library https://github.com/reata/sqllineage.
-
Launch HN: Elementary (YC W22) – Open-source data observability
Is the idea here that it's inspired by re_data due to using dbt transformations underneath or because it's reposted looking nearly the same? (or both?)
Looks like much of the lineage code is also largely a wrapper around this library: https://github.com/reata/sqllineage
Would be curious to understand the project's purpose and unique contributions vs. the underlying dependencies powering it as there seems to be some ambiguity. Is this just a wrapper around dbt transformations and a lineage library in one package? Can I just use them directly?
What are some alternatives?
bedframe - Your Browser Extension Development Framework
elementary - The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
dspy - DSPy: The framework for programming—not prompting—foundation models
re_data - re_data - fix data issues before your users & CEO would discover them 😊
open-interpreter - A natural language interface for computers
dbt-data-reliability - dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
machine_learning_games - Set of games and simulations designed to experiment with QLearning, Neuroevolution, and PoseNet.
deequ - Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
FLaNK-HuggingFace-BLOOM-LLM - https://huggingface.co/bigscience/bloom into NiFi
hrequests - 🚀 Web scraping for humans
hstream - HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.