spyql
partiql-lang-kotlin
spyql | partiql-lang-kotlin | |
---|---|---|
23 | 6 | |
902 | 532 | |
- | 0.2% | |
0.0 | 9.3 | |
over 1 year ago | 3 days ago | |
Jupyter Notebook | Kotlin | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spyql
-
Fq: Jq for Binary Formats
I prefer a SQL-like format. It’s not as complete but it cover most of the day-to-day use cases. Take a look at https://github.com/dcmoura/spyql (I am the author). Congrats on fq!
-
Command-line data analytics made easy with SPyQL
SPyQL documentation: spyql.readthedocs.io
-
This Week In Python
spyql – Query data on the command line with SQL-like SELECTs powered by Python expressions
- Command-line data analytics made easy
-
Jc – JSONifies the output of many CLI tools
This is great!
I am the author of SPyQL [1]. Combining JC with SPyQL you can easily query the json output and run python commands on top of it from the command-line :-) You can do aggregations and so forth in a much simpler and intuitive way than with jq.
I just wrote a blogpost [2] that illustrates it. It is more focused on CSV, but the commands would be the same if you were working with JSON.
[1] https://github.com/dcmoura/spyql
- The fastest command-line tools for querying large JSON datasets
-
Working with more than 10gb csv
You can import the data into a PostgreSQL/MySQL/SQLite/... database and then query the database. However, even with the right choice of indexes, it might take a while to run queries on a table with hundreds of millions of records. You can easily import your data to these databases with SpyQL: $ spyql "SELECT * FROM csv TO sql(table=my_table_name) | sqlite3 my.db" (you would need to create the table my_table_name before running the command).
-
ClickHouse Cloud is now in Public Beta
https://github.com/dcmoura/spyql/blob/master/notebooks/json_...
And ClickHouse looks like a normal relational database - there is no need for multiple components for different tiers (like in Druid), no need for manual partitioning into "daily", "hourly" tables (like you do in Spark and Bigquery), no need for lambda architecture... It's refreshing how something can be both simple and fast.
- A SQLite extension for reading large files line-by-line
-
I want to convert a large JSON file into Tabular Format.
I thought this library was pretty nifty for json. It's also relatively fast compared to most json parsers: https://github.com/dcmoura/spyql
partiql-lang-kotlin
-
Amazon Ion Specification
Ion is heavily used on the retail side of Amazon, but it's only recently started to appear in AWS products.
AWS is starting support PartiQL (https://partiql.org/) queries in some places and PartiQL uses Ion's type system internally.
-
XTDB ‘Core2’ is an experimental, SQL-first, immutable database concept
Yep, that's exactly the vision here. One of the biggest sources of precedent and inspiration for us here is https://partiql.org/ which is picking up a fair bit of traction across AWS. Also see https://rockset.com/
-
What is a Quantum Ledger Database?
QLDB allows you to create a ledger that acts similar to a schema or table space in a traditional database. Once you've created this ledger, you create a table through the SQL-like query language PartiQL that also enables you to interact with the data.
-
Show HN: PRQL – A Proposal for a Better SQL
PartiQL[0] is an open source library that is a superset of SQL that I really like. It supports querying nested structures inside columns, so if a column contains some JSON data you can use the standard dot notation to query nested JSON data directly
[0] https://partiql.org/
-
DynamoDB with PartiQL
PartiQL was introduced to AWS DynamoDB, with AWS making the announcement in 2020 making the life of developers easier, with the comfort of executing commands similar to SQL.
-
Newcomer needs help with Dynamodb (PARTIQL)
I haven't tried it, but I'm pretty sure the problem is that timestamp (and it's various capitalizations) is a keyword in PartiQL (https://github.com/partiql/partiql-lang-kotlin/blob/master/lang/src/org/partiql/lang/syntax/LexerConstants.kt#L221). To get it to be interpreted as an attribute name, you need to enclose TimeStamp in double quotes.
What are some alternatives?
prql - PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
rfcs - RFCs for major changes to EdgeDB
malloy - Malloy is an experimental language for describing data relationships and transformations.
tresql - Shorthand SQL/JDBC wrapper language, providing nested results as JSON and more
Preql - An interpreted relational query language that compiles to SQL.
partiql-ir-generator - PartiQL I.R. Generator (P.I.G.)
prosto - Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
krangl - krangl is a {K}otlin DSL for data w{rangl}ing
pxi - 🧚 pxi (pixie) is a small, fast, and magical command-line data processor similar to jq, mlr, and awk.
logica - Logica is a logic programming language that compiles to SQL. It runs on Google BigQuery, PostgreSQL and SQLite.