Our great sponsors
grai-core | sqlparse | |
---|---|---|
6 | 7 | |
269 | 3,581 | |
2.2% | - | |
9.5 | 8.2 | |
4 days ago | 6 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
grai-core
-
Launch HN: Grai (YC S22) – Open-Source Data Observability Platform
Elastic v2 if one is interested in such things: https://github.com/grai-io/grai-core/blob/v0.1.33/LICENSE
-
Standalone lineage tool
I’m not sure if this is precisely what you’re looking for but Grai might serve your needs. The backend data model allows you to push any arbitrary metadata you want / need onto the lineage graph and retrieve it either through the rest or graph API. I’m one of the authors so happy to answer any questions you might have.
-
Data Load Diagram
We've been looking at building something like this for Grai specifically to support Airflow but haven't yet prioritized it.
-
Grai, a self-hosted data lineage tool. Test downstream impact of data migration changes
We were frustrated because although we had tests in our data warehouse, they only notified us after an outage occurred. What we needed was a way to detect changes during CI/CD, so we could fix things before they impacted production. So we developed Grai, as an open-source data lineage toolkit pre-built integrations for the most common data stores and designed to work with CI tools, like Github Actions.
sqlparse
-
Show HN: Databasediagram.com – Private, Text to Entity-Relationship Diagram Tool
Suggest checking out the sqlparse library for a way to do the different flavours without needing to address each case directly: https://github.com/andialbrecht/sqlparse
-
Data Load Diagram
Gotcha, since we haven't actually written all of this yet I don't have any useful code snippets to share but we've discussed tackling the problem internally using something like sqlparse. You'd need to identify the relevant sql chunks, parse them for table dependency information and then create the relevant entities in whichever data lineage tool you were using.
-
This Week In Python
sqlparse – A non-validating SQL parser module for Python
-
Open Source SQL Parsers
Regular expressions is a popular approach to extract information from SQL statements. However, regular expressions quickly become too complex to handle common features like WITH, sub-queries, windows clauses, aliases and quotes. sqlparse is a popular python package that uses regular expressions to parse SQL.
-
Automated SQL formatting checks
This one is not bad: https://github.com/andialbrecht/sqlparse.
- Let's write a compiler, part 5: A code generator
-
BigQuery Lineage
We used this repo for this: https://github.com/andialbrecht/sqlparse. I may have miscommunicated. We didn't write the parser from scratch, we created a way for the parser to detect downstream and upstream dependencies of the resource.
What are some alternatives?
dbt-snowflake-monitoring - A dbt package from SELECT to help you monitor Snowflake performance and costs
zetasql - ZetaSQL - Analyzer Framework for SQL
awesome-data-catalogs - 📙 Awesome Data Catalogs and Observability Platforms.
pyparsing - Python library for creating PEG parsers [Moved to: https://github.com/pyparsing/pyparsing]
jupysql - Better SQL in Jupyter. 📊
Lark - Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
MindsDB - The platform for customizing AI from enterprise data
PLY - Python Lex-Yacc
django-pgschemas - Django multi-tenancy through Postgres schemas
sqlfluff - A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
ibis - the portable Python dataframe library
Pygments