datastation VS dsq

Compare datastation vs dsq and see what are their differences.

dsq

Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more. (by multiprocessio)
Our great sponsors
  • SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
datastation dsq
25 20
2,854 3,634
0.4% 4.8%
0.0 4.3
6 months ago 7 months ago
TypeScript Go
GNU General Public License v3.0 or later GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

datastation

Posts with mentions or reviews of datastation. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-08.
  • Code coverage for Go integration tests
    6 projects | news.ycombinator.com | 8 Mar 2023
    There was a technique that existed already where you could use `go test -cover` and the `-o` flag to produce a binary from `go test` rather than actually running tests. So you could build a binary that had coverage enabled. Then when you ran

    Here's an example: https://github.com/multiprocessio/datastation/blob/main/runn....

    I can't remember where I found this technique but it's been around for a while.

    This new option is the same thing but a way to `go build` with `-cover` instead of `go test -cover -o $out`? Do I have that right?

  • Engineers using dbt with VS Code - how are you previewing your results in lieu of the functionality provided by dbt cloud?
    2 projects | /r/dataengineering | 29 Jun 2022
    If my employer doesn't consider paying for dbt cloud, I will use u/eatonphil 's datastation, run the queries on a dev database then put them in dbt.
  • Show HN: DataStation – App to easily query, script, and visualize data
    4 projects | news.ycombinator.com | 31 May 2022
  • Windmill.dev
    6 projects | news.ycombinator.com | 5 May 2022
    I build a somewhat similar app, DataStation [0], that is in JavaScript and Go. It supports scripting in Python, Julia, R, JavaScript, Ruby, etc.

    The server version of it exists and I run it myself but that process is not documented yet. (Most people use it as a desktop app today.)

    [0] https://github.com/multiprocessio/datastation

  • Datasette Lite: a server-side Python web application running in a browser
    5 projects | news.ycombinator.com | 4 May 2022
    My biggest issue with Pyodide is the long wait times. I haven't figured out a way around a ~5 second load time where the entire UI hangs every single time you load the page.

    My app (similar to Simon's, a lite mode of a data IDE): https://app.datastation.multiprocess.io.

    My code: https://github.com/multiprocessio/datastation/blob/main/shar....

  • Lies we tell ourselves to keep using Golang
    7 projects | news.ycombinator.com | 29 Apr 2022
    I use Go heavily cross-platform developing DataStation [0] and dsq [1]. I am not an expert. And I don't have proof for it but on some rudimentary benchmarks the Linux-specific file idioms in the Go standard library definitely don't seem to translate well to even macOS let alone Windows. For example some good streaming techniques for reading large files on Linux that work really well there seemed to be pretty bad on macOS.

    I think Amos has presented more proof than I can on the topic of just how Linux-influenced Go is. And I think it is fine for the majority of Go users because the majority users of Go are building server apps or Linux CLIs.

    Amos has spent some time building cross-platform desktop systems with Go for itch.io and I think I'm seeing some of the same things they are in that scenario.

    I think this is a reasonable article. If Amos gets flame-y at any point I think it's worth ignoring because there does seem to be something up with Go in cross-platform applications.

    I like Go a lot and for most things I'd keep using it still. Just sharing some observations.

    [0] https://github.com/multiprocessio/datastation

    [1] https://github.com/multiprocessio/dsq

  • Feeling overwhelmed when trying to contribute to opensource projects
    2 projects | /r/software | 6 Apr 2022
    I keep a page of good first projects for two big projects I work on. The only expectation is that you know Go. I've had a couple of people who've never contributed to OSS come in and get some meaningful features merged.
  • Ask HN: Who wants to collaborate? (April 2022)
    21 projects | news.ycombinator.com | 1 Apr 2022
    I've got some good first projects if you're interested in OSS data tools and have some Go experience.

    Check out: https://github.com/multiprocessio/datastation/blob/main/GOOD...

  • Open source Go projects to contribute (beginners)
    25 projects | /r/golang | 5 Mar 2022
    Some example projects: DataStation (desktop GUI for querying every kind of database, scripting and graphing the results) and dsq (a CLI companion for running SQL queries on many kinds of files), and go-json (a library for fast JSON encoding of arrays of large objects).
  • Ask HN: Anyone making a living building desktop applications?
    13 projects | news.ycombinator.com | 21 Jan 2022
    I'm building a desktop-first (SaaS-eventual) data IDE for developers [0]. Making a living? Not yet.

    It being desktop-first makes it as easy to try out in a corporate environment as Sublime. The data never leaves your machine. Desktop-first is a big deal in devtools for this reason.

    [0] https://github.com/multiprocessio/datastation

dsq

Posts with mentions or reviews of dsq. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-02.
  • Tracking SQLite Database Changes in Git
    7 projects | news.ycombinator.com | 2 Nov 2023
    You might want to look at tsv-utils, or a similar project: https://github.com/eBay/tsv-utils

    For the SQL part, but maybe a lot heavier, you can use one of the projects listed on this page: https://github.com/multiprocessio/dsq (No longer maintained, but has links to lots of other projects)

  • DuckDB: Querying JSON files as if they were tables
    9 projects | news.ycombinator.com | 3 Mar 2023
    Welcome to the gang! :)

    https://github.com/multiprocessio/dsq#comparisons

  • Ask HN: Programs that saved you 100 hours? (2022 edition)
    69 projects | news.ycombinator.com | 20 Dec 2022
  • Command-line data analytics made easy
    6 projects | news.ycombinator.com | 3 Nov 2022
    SPyQL is really cool and its design is very smart, with it being able to leverage normal Python functions!

    As far as similar tools go, I recommend taking a look at DataFusion[0], dsq[1], and OctoSQL[2].

    DataFusion is a very (very very) fast command-line SQL engine but with limited support for data formats.

    dsq is based on SQLite which means it has to load data into SQLite first, but then gives you the whole breath of SQLite, it also supports many data formats, but is slower at the same time.

    OctoSQL is faster, extensible through plugins, and supports incremental query execution, so you can i.e. calculate a running group by + count while tailing a log file. It also supports normal databases, not just file formats, so you can i.e. join with a Postgres table.

    [0]: https://github.com/apache/arrow-datafusion

    [1]: https://github.com/multiprocessio/dsq

    [2]: https://github.com/cube2222/octosql

    Disclaimer: Author of OctoSQL

  • Jq Internals: Backtracking
    10 projects | news.ycombinator.com | 5 Oct 2022
    > dsq registers go-sqlite3-stdlib so you get access to numerous statistics, url, math, string, and regexp functions that aren't part of the SQLite base. (https://github.com/multiprocessio/dsq#standard-library)

    Ah, I wondered if they rolled their own SQL parser, but no, I now see the sqlite.go in the repo and all is made clear

  • Run SQL on CSV, Parquet, JSON, Arrow, Unix Pipes and Google Sheet
    9 projects | news.ycombinator.com | 24 Sep 2022
    I am currently evaluating dsq and its partner desktop app DataStation. AIUI, the developer of DataStation realised that it would be useful to extract the underlying pieces into a standalone CLI, so they both support the same range of sources.

    dsq CLI - https://github.com/multiprocessio/dsq

  • multiprocessio / dsq :
    1 project | /r/golang | 1 Sep 2022
  • OctoSQL allows you to join data from different sources using SQL
    13 projects | news.ycombinator.com | 14 Jul 2022
    OctoSQL is an awesome project and Kuba has a lot of great experience to share from building this project I'm excited to learn from.

    And while building a custom database engine does allow you to do pretty quick queries, there are a few issues.

    First, the SQL implemented is nonstandard. As I was looking for documentation and it pointed me to `SELECT * FROM docs.functions fs`. I tried to count the number of functions but octosql crashed (a Go panic) when I ran `SELECT count(1) FROM docs.functions fs` and `SELECT count() FROM docs.functions fs` which is what I lazily do in standard SQL databases. (`SELECT count(fs.name) FROM docs.function fs` worked.)

    This kind of thing will keep happening because this project just doesn't have as much resources today as SQLite, Postgres, DuckDB, etc. It will support a limited subset of SQL.

    Second, the standard library seems pretty small. When I counted the builtin functions there were only 29. Now this is an easy thing to rectify over time but just noting about the state today.

    And third this project only has builtin support for querying CSV and JSON files. Again this could be easy to rectify over time but just mentioning the state today.

    octosql is a great project but there are also different ways to do the same thing.

    I build dsq [0] which runs all queries through SQLite so it avoids point 1. It has access to SQLite's standard builtin functions plus* a battery of extra statistic aggregation, string manipulation, url manipulation, date manipulation, hashing, and math functions custom built to help this kind of interactive querying developers commonly do [1].

    And dsq supports not just CSV and JSON but parquet, excel, ODS, ORC, YAML, TSV, and Apache and nginx logs.

    A downside to dsq is that it is slower for large files (say over 10GB) when you only want a few columns whereas octosql does better in some of those cases. I'm hoping to improve this over time by adding a SQL filtering frontend to dsq but in all cases dsq will ultimately use SQLite as the query engine.

    You can find more info about similar projects in octosql's Benchmark section but I also have a comparison section in dsq [2] and an extension of the octosql benchmark with different set of tools [3] including duckdb.

    Everyone should check out duckdb. :)

    [0] https://github.com/multiprocessio/dsq

    [1] https://github.com/multiprocessio/go-sqlite3-stdlib

    [2] https://github.com/multiprocessio/dsq#comparisons

    [3] https://github.com/multiprocessio/dsq#benchmark

  • GitHub Actions are down again
    2 projects | news.ycombinator.com | 29 Jun 2022
    What's annoying about this is that the PR doesn't even say it's trying to run tests. It says everything is passing and just doesn't list the actions.

    For a second I thought someone must have deleted the actions yaml files.

    This is a dangerous failure mode.

    https://github.com/multiprocessio/dsq/pull/82

  • Xlite: Query Excel, Open Document spreadsheets (.ods) as SQLite virtual tables
    6 projects | news.ycombinator.com | 25 Jun 2022
    This is a cool project! But if you query Excel and ODS files with dsq you get the same thing plus a growing standard library of functions that don't come built into SQLite such as best-effort date parsing, URL parsing/extraction, statistical aggregation functions, math functions, string and regex helpers, hashing functions and so on [1].

    [0] https://github.com/multiprocessio/dsq

    [1] https://github.com/multiprocessio/go-sqlite3-stdlib

What are some alternatives?

When comparing datastation and dsq you can also consider the following projects:

homebrew-emacs-plus - Emacs Plus formulae for the Homebrew package manager

go-duckdb - go-duckdb provides a database/sql driver for the DuckDB database engine.

gecko-dev - Read-only Git mirror of the Mercurial gecko repositories at https://hg.mozilla.org. How to contribute: https://firefox-source-docs.mozilla.org/contributing/contribution_quickref.html

q - q - Run SQL directly on delimited files and multi-file sqlite databases

vscode-jupyter - VS Code Jupyter extension

querycsv - QueryCSV enables you to load CSV files and manipulate them using SQL queries then after you finish you can export the new values to a CSV file

golang-samples - Sample apps and code written for Google Cloud in the Go programming language.

octosql - OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

datasette - An open source multi-tool for exploring and publishing data

xlite - Query Excel spredsheets (.xlsx, .xls, .ods) using SQLite

oursh - Your comrade through the perilous world of UNIX.

textql - Execute SQL against structured text like CSV or TSV