itachi
sqlglot
itachi | sqlglot | |
---|---|---|
2 | 56 | |
54 | 5,511 | |
- | - | |
4.3 | 9.9 | |
8 months ago | 7 days ago | |
Scala | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
itachi
-
Tips for building popular open source data engineering projects
Feel free to ping me if you'd like me to help vet your ideas or market them. I'm happy to write blog posts and try to help you get users (some ppl hate blogging). itachi is an example of a genius project that I'm helping Kent market (he's a Spark PMC). The project is genius and it just needs exposure to get users.
-
Scala Spark vs Python PySpark: Which is better?
I think I understand you now. The functions in itachi are examples of what you call custom expressions. You're saying that custom expressions can only be defined in Scala, not in Python, so that's a Scala advantage, right?
sqlglot
-
The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol
This is probably referring to "zero changes to your driver code" and not "zero changes to the SQL you send over this driver".
Translating between SQL dialects is notoriously hard and attempts to translate [1] are working in 95% of cases. But the last 5% would require 5x amount of work. That's because "SQL dialect" also includes weird edge cases of type inference of things like COALESCE(5, FALSE) and emulation of system catalogs (pg_catalog, information_schema).
[1] https://github.com/tobymao/sqlglot
- FLaNK AI Weekly 18 March 2024
- SQLGlot: No-dependency SQL parser, transpiler, optimizer for 21 SQL dialects
-
Transpile Any SQL to PostgreSQL Dialect
Recommend checking out https://github.com/tobymao/sqlglot if you are interested in this capability for other SQL dialects
Tools like this are helpful for:
- Rendering SQL in a consistent way, eg for snapshot testing
-
This Week In Python
sqlglot – Python SQL Parser and Transpiler
- SQLglot: Python SQL Parser and Transpiler
-
Build the dependency graph of your BigQuery pipelines at no cost: a Python implementation
In the project we used Python lib networkx and a DiGraph object (Direct Graph). To detect a table reference in a Query, we use sqlglot, a SQL parser (among other things) that works well with Bigquery.
- A Primer on SQLGlot's Abstract Syntax Tree
-
Show HN: SQL Polyglot
Cool! Is this built with sqlglot[1] on the back end?
[1] https://github.com/tobymao/sqlglot
-
sqlglot - Amazing SQL parsing library
Wanted to give sqlglot a shoutout as it saved me a ton of time.
What are some alternatives?
cube.js - 📊 Cube — The Semantic Layer for Building Data Applications
sqloxide - Python bindings for sqlparser-rs
Quill - Compile-time Language Integrated Queries for Scala
JSqlParser - JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern
chispa - PySpark test helper methods with beautiful error messages
Transcrypt - Python 3.9 to JavaScript compiler - Lean, fast, open! -
zetasql - ZetaSQL - Analyzer Framework for SQL
duckdb - DuckDB is an in-process SQL OLAP Database Management System
criterion.rs - Statistics-driven benchmarking library for Rust
py2many - Transpiler of Python to many other languages
sirix - SirixDB is an an embeddable, bitemporal, append-only database system and event store, storing immutable lightweight snapshots. It keeps the full history of each resource. Every commit stores a space-efficient snapshot through structural sharing. It is log-structured and never overwrites data. SirixDB uses a novel page-level versioning approach.
splink - Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends