Dsq: Commandline tool for running SQL queries against JSON, CSV, Parquet, etc.

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

roapi

24 3,070 6.9 Rust

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

Neat! I have also built a similar project in Rust https://github.com/roapi/roapi/tree/main/columnq-cli :)

q

46 10,109 3.6 Python

q - Run SQL directly on delimited files and multi-file sqlite databases (by harelba)

dsq references a benchmark done by q (https://github.com/harelba/q/blob/master/test/BENCHMARK.md) that indicates that octosql is significantly slower.
However, octosql's GH repo claims otherwise.
Does anyone have any real world experience that they can share on these tools?

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
go-duckdb

4 492 8.2 Go

go-duckdb provides a database/sql driver for the DuckDB database engine.

Yeah frankly the q benchmark isn't the best even though dsq compares favorably in it. It isn't well documented and exercises a very limited amount of functionality and isn't very rigorous from what I can see. That said, the caching q does is likely very solid (and not something dsq does).
The biggest risk I think with octosql (and cube2222 is here somewhere to disagree with me if I'm wrong) is that they have their own entire SQL engine whereas textql, q and dsq use SQLite. But q is also in Python whereas textql, octosql, and dsq are in Go.
In the next few weeks I'll be posting some benchmarks that I hope are a little fairer (or at least well-documented and reproducible). Though of course it would be appropriate to have independent benchmarks too since I now have a dog in the fight.
On a tangent, once the go-duckdb binding [0] matures I'd love to offer duckdb as an alternative engine flag within dsq (and DataStation). Would be neat to see.
[0] https://github.com/marcboeker/go-duckdb

textql

15 9,031 3.7 Go

Execute SQL against structured text like CSV or TSV
dsq

20 3,619 4.3 Go

Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.

Binaries are now available! https://github.com/multiprocessio/dsq#macos-linux

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

How moving from Pandas to Polars made me write better code without writing better code
2 projects | dev.to | 5 Mar 2024
Polars
11 projects | news.ycombinator.com | 8 Jan 2024
Full-fledged APIs for slowly moving datasets without writing code
1 project | news.ycombinator.com | 25 Oct 2023
Ask HN: Best way to provide access to large data sets
2 projects | news.ycombinator.com | 11 Apr 2023
"thread 'main' panicked at 'no CA certificates found'", when running application in docker container
3 projects | /r/rust | 4 Apr 2023

Dsq: Commandline tool for running SQL queries against JSON, CSV, Parquet, etc.

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
SQL Python GraphQL Arrow CLI
Post date: 11 Jan 2022

roapi

q

InfluxDB

go-duckdb

textql

dsq

WorkOS

Related posts

Dsq: Commandline tool for running SQL queries against JSON, CSV, Parquet, etc.

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com SQL Python GraphQL Arrow CLI Post date: 11 Jan 2022

roapi

q

InfluxDB

go-duckdb

textql

dsq

WorkOS

Related posts

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
SQL Python GraphQL Arrow CLI
Post date: 11 Jan 2022