|18 days ago||4 days ago|
|Apache License 2.0||MIT License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Mangle, a programming language for deductive database programming
12 projects | news.ycombinator.com | 26 Nov 2022
There are even table-valued functions.
These things are not widespread, and differ by implementation, and the way these are used by clients are copy-and-paste. Something as thoughtful as ZetaSQL https://github.com/google/zetasql does not have mechanisms for structuring (modules, packages, interfaces). SQL will not, cannot evolve into such a direction (or, anything that evolves, will not be recognizable as SQL).
goccy/bigquery-emulator: BigQuery emulator server implemented in Go
2 projects | /r/bigquery | 17 Oct 2022
Hi, I develop a BigQuery emulator ( https://github.com/goccy/bigquery-emulator ) from early 2022. It is written in Go, but can be used from bq command line tool and other language's (e.g. Python ) client SDK after installing docker image or released binary. It currently supports over 200 of the nearly 330 standard functions in BigQuery and all data types except GEOGRAPHY ( see https://github.com/goccy/go-zetasqlite#status for details ). ZetaSQL ( https://github.com/google/zetasql ) is used to parse and analyze queries.
7 projects | news.ycombinator.com | 22 Aug 2022
If you don't want to do it yourself, there's this:
Parsing is huge but it's amazing how small a part of the job it is. This library isn't even the half of it.
SQLGlot: SQL parser, transpiler, optimizer – translate to Presto, Spark, Hive
9 projects | news.ycombinator.com | 4 Jul 2022
New PostgreSQL Interface for Cloud Spanner
2 projects | news.ycombinator.com | 12 Oct 2021
It is amazingly good.
You give it textual SQL (+ schema + all your function definitions) and it returns a really clean logical query plan. It is also happy to do this via a protobufs so you can use it from languages other than C++. It is also tested and documented up the wazoo. It has been such a pleasure to work with.
Anyway, the big problem with ZetaSQL is that it is not a common SQL dialect.
It seems that the only reasonable way to do this PostgreSQL interface for Cloud Spanner is to add a second parser (and other extensions) to ZetaSQL. If I am correct, I really really hope they open source that part of ZetaSQL as well - it would be a massive step forward for open source SQL tooling.2 projects | news.ycombinator.com | 12 Oct 2021
I mean the postgres parser (and semantic changes) for ZetaSQL. The zetasql parser is in a file called zetasql/parser/bison_parser.y, I strongly suspect they now have a file called something like zetasql/pgparser/bison_parser.y as well (and much more pervasive changes to support the deeper differences in the dialects).
This is the lexical structure and syntax docs for the new postgres inteface to cloud spanner:
And this is the zetasql lexical structure and syntax docs:
Notice that the new PG docs are an edit of the Zeta ones - evidence that my hypothesis is correct.
Open Source SQL Parsers
17 projects | dev.to | 8 Oct 2021
zetasql implements BigQuery, Spanner, and Dataflow dialects.
Let's write a compiler, part 5: A code generator
14 projects | news.ycombinator.com | 19 Aug 2021
ZetaSQL seems like it could be a fit for your use case. I've worked with Apache Calcite in the past and found it to be very complex to work with. I found ZetaSQL to be a little easier to use.
BigQuery Language Server
2 projects | /r/bigquery | 6 Jul 2021
I’m not aware of one but you could probably use ZetaSQL to put one together, the difficult work has been opened, you’d just need to add the LSP layer.
A new template-defined width integer C++ library has snuck its way into Google ZetaSQL
2 projects | /r/coolgithubprojects | 9 Mar 2021
Some additional helper functions here.
The SQL Unit Testing Landscape: 2023
2 projects | /r/dataengineering | 3 May 2023
https://github.com/tobymao/sqlglot does allow you execute python against sql but window function is not allowed…
How to manipulate SQL string programmatically?
2 projects | /r/dataengineering | 28 Apr 2023
Use a SQL Parser like sqlglot or Apache Calcite to compile user's query into an AST.
SQL: 2023 is finished: Here is what's new
3 projects | news.ycombinator.com | 13 Apr 2023
sqlglot might be what you are looking for.
Building Column Level Lineage for dbt
3 projects | /r/dataengineering | 8 Mar 2023
SQL server to postgreSQL migration
2 projects | /r/dotnet | 23 Feb 2023
SQLGlot might be a bit too rudimentary for your case, but might provide an excellent starting point. Note that it is written in Python.
Is it possible to set up a mini snowflake instance on a home server to load mock data sets and practice building a warehouse with SQL?
2 projects | /r/SQL | 3 Feb 2023
and use https://github.com/tobymao/sqlglot to transpile Snowflake SQL to Duck SQL
Writing a Python SQL engine from scratch
4 projects | news.ycombinator.com | 3 Jan 2023
Pure Python Distributed SQL Engine
9 projects | news.ycombinator.com | 30 Dec 2022
MySQL-mimic - Python implementation of the MySQL server wire protocol.
4 projects | /r/Python | 31 Oct 2022
\Note - mysql-mimic is just the wire protocol. We use* sqlglot to parse/optimize SQL.
"Query Oracle + Sql"
2 projects | /r/dataengineering | 17 Oct 2022
If you're talking about generating SQL, try https://github.com/tobymao/sqlglot. It allows you to define query logic using Python, and then generate Oracle and SQL Server (I assume with 'sql' you mean SQL Server -> try to use clear terminology please) compatible SQL.
What are some alternatives?
sqlparse - A non-validating SQL parser module for Python
sqloxide - Python bindings for sqlparser-rs
Apache Calcite - Apache Calcite
JSqlParser - JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern
ANTLR - ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
pgsql-parser - PostgreSQL Query Parser for Node.js
duckdb - DuckDB is an in-process SQL OLAP Database Management System
Presto - The official home of the Presto distributed SQL query engine for big data