tsv-utils VS dsq

Compare tsv-utils vs dsq and see what are their differences.

tsv-utils

eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more. (by eBay)

dsq

Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more. (by multiprocessio)
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
tsv-utils dsq
10 20
1,446 3,836
0.4% 1.1%
0.0 4.3
almost 3 years ago almost 2 years ago
D Go
gtkbook License GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

tsv-utils

Posts with mentions or reviews of tsv-utils. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-21.

dsq

Posts with mentions or reviews of dsq. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-02.
  • Tracking SQLite Database Changes in Git
    7 projects | news.ycombinator.com | 2 Nov 2023
    You might want to look at tsv-utils, or a similar project: https://github.com/eBay/tsv-utils

    For the SQL part, but maybe a lot heavier, you can use one of the projects listed on this page: https://github.com/multiprocessio/dsq (No longer maintained, but has links to lots of other projects)

  • DuckDB: Querying JSON files as if they were tables
    9 projects | news.ycombinator.com | 3 Mar 2023
    Welcome to the gang! :)

    https://github.com/multiprocessio/dsq#comparisons

  • Ask HN: Programs that saved you 100 hours? (2022 edition)
    69 projects | news.ycombinator.com | 20 Dec 2022
  • Command-line data analytics made easy
    6 projects | news.ycombinator.com | 3 Nov 2022
    SPyQL is really cool and its design is very smart, with it being able to leverage normal Python functions!

    As far as similar tools go, I recommend taking a look at DataFusion[0], dsq[1], and OctoSQL[2].

    DataFusion is a very (very very) fast command-line SQL engine but with limited support for data formats.

    dsq is based on SQLite which means it has to load data into SQLite first, but then gives you the whole breath of SQLite, it also supports many data formats, but is slower at the same time.

    OctoSQL is faster, extensible through plugins, and supports incremental query execution, so you can i.e. calculate a running group by + count while tailing a log file. It also supports normal databases, not just file formats, so you can i.e. join with a Postgres table.

    [0]: https://github.com/apache/arrow-datafusion

    [1]: https://github.com/multiprocessio/dsq

    [2]: https://github.com/cube2222/octosql

    Disclaimer: Author of OctoSQL

  • Jq Internals: Backtracking
    10 projects | news.ycombinator.com | 5 Oct 2022
    > dsq registers go-sqlite3-stdlib so you get access to numerous statistics, url, math, string, and regexp functions that aren't part of the SQLite base. (https://github.com/multiprocessio/dsq#standard-library)

    Ah, I wondered if they rolled their own SQL parser, but no, I now see the sqlite.go in the repo and all is made clear

  • Run SQL on CSV, Parquet, JSON, Arrow, Unix Pipes and Google Sheet
    9 projects | news.ycombinator.com | 24 Sep 2022
    I am currently evaluating dsq and its partner desktop app DataStation. AIUI, the developer of DataStation realised that it would be useful to extract the underlying pieces into a standalone CLI, so they both support the same range of sources.

    dsq CLI - https://github.com/multiprocessio/dsq

  • multiprocessio / dsq :
    1 project | /r/golang | 1 Sep 2022
  • OctoSQL allows you to join data from different sources using SQL
    13 projects | news.ycombinator.com | 14 Jul 2022
  • GitHub Actions are down again
    2 projects | news.ycombinator.com | 29 Jun 2022
    What's annoying about this is that the PR doesn't even say it's trying to run tests. It says everything is passing and just doesn't list the actions.

    For a second I thought someone must have deleted the actions yaml files.

    This is a dangerous failure mode.

    https://github.com/multiprocessio/dsq/pull/82

  • Xlite: Query Excel, Open Document spreadsheets (.ods) as SQLite virtual tables
    6 projects | news.ycombinator.com | 25 Jun 2022
    This is a cool project! But if you query Excel and ODS files with dsq you get the same thing plus a growing standard library of functions that don't come built into SQLite such as best-effort date parsing, URL parsing/extraction, statistical aggregation functions, math functions, string and regex helpers, hashing functions and so on [1].

    [0] https://github.com/multiprocessio/dsq

    [1] https://github.com/multiprocessio/go-sqlite3-stdlib

What are some alternatives?

When comparing tsv-utils and dsq you can also consider the following projects:

zsv - zsv+lib: tabular data swiss-army knife CLI + world's fastest (simd) CSV parser

go-duckdb - go-duckdb provides a database/sql driver for the DuckDB database engine.

goawk - A POSIX-compliant AWK interpreter written in Go, with CSV support

jless - jless is a command-line JSON viewer designed for reading, exploring, and searching through JSON data.

dextool - Suite of C/C++ tooling built on LLVM/Clang

textql - Execute SQL against structured text like CSV or TSV

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured