xsv vs goawk

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

xsv		goawk
	Project
64	Mentions	19
10,089	Stars	1,888
-	Growth	-
0.0	Activity	7.2
2 months ago	Latest Commit	10 days ago
Rust	Language	Go
The Unlicense	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

xsv

Posts with mentions or reviews of xsv. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-02.

Show HN: TextQuery – Query and Visualize Your CSV Data in Minutes
3 projects | news.ycombinator.com | 2 Apr 2024

I realize it's not really that comparable since these tools don't support SQL, but a more fully functioned CLI tool is - https://github.com/BurntSushi/xsv
They are both fairly good
Qsv: Efficient CSV CLI Toolkit
8 projects | news.ycombinator.com | 22 Dec 2023
Joining CSV Data Without SQL: An IP Geolocation Use Case
3 projects | news.ycombinator.com | 19 Oct 2023

I have done some similar, simpler data wrangling with xsv (https://github.com/BurntSushi/xsv) and jq. It could process my 800M rows in a couple of minutes (plus the time to read it out from the database =)
Qsv: CSVs sliced, diced and analyzed (fork of xsv)
2 projects | news.ycombinator.com | 27 Jun 2023

xsv, which seems to be why qsv was created.
[1] https://github.com/BurntSushi/xsv/issues/267
I wrote this iCalendar (.ics) command-line utility to turn common calendar exports into more broadly compatible CSV files.
6 projects | /r/commandline | 24 Mar 2023

CSV utilities (still haven't pick a favorite one...): https://github.com/harelba/q https://github.com/BurntSushi/xsv https://github.com/wireservice/csvkit https://github.com/johnkerl/miller
Icsp – Command-line iCalendar (.ics) to CSV parser
3 projects | news.ycombinator.com | 24 Mar 2023
ripgrep is faster than {grep, ag, git grep, ucg, pt, sift}
8 projects | /r/programming | 24 Mar 2023

$ git remote -v origin [email protected]:rust-lang/rust (fetch) origin [email protected]:rust-lang/rust (push) $ git rev-parse HEAD 3b0d4813ab461ec81eab8980bb884691c97c5a35 $ time grep -ri burntsushi ./ ./src/tools/cargotest/main.rs: repo: "https://github.com/BurntSushi/ripgrep", ./src/tools/cargotest/main.rs: repo: "https://github.com/BurntSushi/xsv", grep: ./target/debug/incremental/cargotest-2dvu4f2km9e91/s-gactj3ma2j-1b10l4z-2l60ur55ixe6n/query-cache.bin: binary file matches grep: ./target/debug/incremental/cargotest-38cpmhhbdgdyq/s-gactj3luwq-1o12vgp-t61hd8qdyp7t/query-cache.bin: binary file matches grep: ./target/debug/incremental/cargotest-17632op6djxne/s-gawuq5468i-1h69nfw-4gm0s8yhhiun/query-cache.bin: binary file matches grep: ./target/debug/incremental/cargotest-2trm4kt5yom3r/s-gawuq53qqg-bjiezj-lo0gha8ign8w/query-cache.bin: binary file matches grep: ./target/debug/deps/libregex_automata-c74a6d9fd0abd77b.rmeta: binary file matches grep: ./target/debug/deps/libsame_file-a0e0363a2985455d.rlib: binary file matches grep: ./target/debug/deps/libsame_file-a0e0363a2985455d.rmeta: binary file matches grep: ./target/debug/deps/libsame_file-7251d8d3586a319b.rmeta: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-sysroot/lib/rustlib/x86_64-unknown-linux-gnu/lib/libaho_corasick-999a08e2b700420d.rlib: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-sysroot/lib/rustlib/x86_64-unknown-linux-gnu/lib/libregex_automata-0d168be5d25b3ac5.rlib: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-tools/x86_64-unknown-linux-gnu/release/deps/libregex_automata-7d6bec0156f15da1.rlib: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-tools/x86_64-unknown-linux-gnu/release/deps/libregex_automata-7d6bec0156f15da1.rmeta: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-tools/x86_64-unknown-linux-gnu/release/deps/libaho_corasick-07dee4514b87d99b.rmeta: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-tools/x86_64-unknown-linux-gnu/release/deps/libaho_corasick-07dee4514b87d99b.rlib: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-rustc/x86_64-unknown-linux-gnu/release/deps/libaho_corasick-999a08e2b700420d.rlib: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-rustc/x86_64-unknown-linux-gnu/release/deps/libaho_corasick-999a08e2b700420d.rmeta: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-rustc/x86_64-unknown-linux-gnu/release/deps/libregex_automata-0d168be5d25b3ac5.rlib: binary file matches grep: ./build/x86_64-unknown-linux-gnu/stage0-rustc/x86_64-unknown-linux-gnu/release/deps/libregex_automata-0d168be5d25b3ac5.rmeta: binary file matches grep: ./build/bootstrap/debug/deps/libaho_corasick-992e1ba08ef83436.rmeta: binary file matches grep: ./build/bootstrap/debug/deps/libignore-54d41239d2761852.rmeta: binary file matches grep: ./build/bootstrap/debug/deps/libsame_file-9a5e3ddd89cfe599.rlib: binary file matches grep: ./build/bootstrap/debug/deps/libregex_automata-8e700951c9869a66.rlib: binary file matches grep: ./build/bootstrap/debug/deps/libignore-54d41239d2761852.rlib: binary file matches grep: ./build/bootstrap/debug/deps/libaho_corasick-992e1ba08ef83436.rlib: binary file matches grep: ./build/bootstrap/debug/deps/libregex_automata-8e700951c9869a66.rmeta: binary file matches grep: ./build/bootstrap/debug/deps/libsame_file-9a5e3ddd89cfe599.rmeta: binary file matches real 16.683 user 15.793 sys 0.878 maxmem 8 MB faults 0
Any Linux admins willing to try Pygrep?
6 projects | /r/linuxadmin | 18 Mar 2023

Unrelated, are you the same burntsushi that wrote xsv?
Analyzing multi-gigabyte JSON files locally
14 projects | news.ycombinator.com | 18 Mar 2023

If it could be tabular in nature, maybe convert to sqlite3 so you can make use of indexing, or CSV to make use of high-performance tools like xsv or zsv (the latter of which I'm an author).
https://github.com/BurntSushi/xsv
https://github.com/liquidaty/zsv/blob/main/docs/csv_json_sql...
What monitoring tool do you use or recommend?
5 projects | /r/selfhosted | 6 Mar 2023

Oh and there's rad cli shit out there for CSV files too, like xsv

goawk

Posts with mentions or reviews of goawk. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-29.

GoAWK, an Awk interpreter written in Go (2018)
1 project | news.ycombinator.com | 3 Feb 2024
The Awk Programming Language, Second Edition
18 projects | news.ycombinator.com | 29 Jun 2023

TIL: GoAWK [1] - A POSIX-compliant AWK interpreter written in Go, with CSV support.
[1]: https://github.com/benhoyt/goawk
Looking for a script for csv file
1 project | /r/awk | 20 Mar 2023
Anyone else doing compiler work in Golang?
10 projects | /r/golang | 28 Feb 2023

Another nice project that I have used from time to time (and a very good source for insight) is the awk interpreter written in go https://github.com/benhoyt/goawk
Tool to interact with CSV
9 projects | /r/commandline | 27 Feb 2023

No, I want exactly the opposite - it should be a , b,c as a single string field containing a literal comma, and c. For example, https://github.com/benhoyt/goawk has csv support. https://github.com/benhoyt/goawk/blob/master/docs/csv.md - more info.
Why does awk parse '1&&x=1' as '1&&(x=1)' not '(1&&x)=1' when '&&' is high precedence than '='?
1 project | /r/ProgrammingLanguages | 11 Feb 2023

I've had a go at solving this in this PR -- feedback welcome. I don't love it, but oh well, it solves the problem at hand. Your comment pointed me in the right direction, thanks again.
Looking for programming languages created with Go
23 projects | /r/golang | 6 Nov 2022

There are quite a few re-implementations of scripting languages like Lua in Go. I've written an AWK interpreter in Go.
Oracle DB support in Benthos
8 projects | /r/golang | 7 Oct 2022

github.com/benhoyt/goawk -> this library lets you embed an AWK runtime in your applications, very easy to use and useful for enabling some powerful scripting in things you build
Brian Kernighan adds Unicode support to Awk (May, 2022)
13 projects | news.ycombinator.com | 20 Aug 2022

Yes, that's right. With my simplistic UTF-8-based implementation it turned length() -- for example -- from O(1) to O(N), turning O(N) algorithms which use length() into O(N^2). See this issue: https://github.com/benhoyt/goawk/issues/93
Similar with substr() and other string functions, which when operating as bytes are O(1), but become O(N) when trying to count the number of codepoints as UTF-8.
GNU Gawk has a fancier approach, which stores strings as UTF-8 as long as it can, but converts to UTF-32 if it needs to (eg: the string is non-ASCII and you call substr).
It looks like Brian Kernighan's code has the same issue with length() and substr(). I'm going to try to email him about this, as I think it's kind of a performance blocker.
Ask HN: Is having a Personal blog/brand worth it for you?
7 projects | news.ycombinator.com | 18 Jul 2022

I'm not sure if it was via my personal website or just my GitHub profile, but I got my current job at Canonical due to the CTO there reaching out about my GoAWK project (https://github.com/benhoyt/goawk). I get regular recruitment emails because I have my CV/resume online: most of them are very low-effort, but 1 in 20 or something are interesting emails where the recruiter has actually looked at my website and will tailor it personally. I also just enjoy technical writing, and get joy out of sharing it on HN. So it's "worth it" for me.

What are some alternatives?

When comparing xsv and goawk you can also consider the following projects:

csvtk - A cross-platform, efficient and practical CSV/TSV toolkit in Golang

bytehound - A memory profiler for Linux.

miller - Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

tsv-utils - eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.

ripgrep - ripgrep recursively searches directories for a regex pattern while respecting your gitignore

awka - Revive awka - Awk to C Compiler

Servo - Servo, the embeddable, independent, memory-safe, modular, parallel web rendering engine

intellij-awk - The missing IntelliJ IDEA language support plugin for AWK

Fractalide - Reusable Reproducible Composable Software

tumblelog - A static tumblelog generator available as both a Perl and Python version

svgcleaner - svgcleaner could help you to clean up your SVG files from the unnecessary data.

awk - One true awk

xsv vs csvtk goawk vs bytehound xsv vs miller goawk vs tsv-utils xsv vs ripgrep goawk vs awka xsv vs Servo goawk vs intellij-awk xsv vs Fractalide goawk vs tumblelog xsv vs svgcleaner goawk vs awk

Compare xsv vs goawk and see what are their differences.

xsv

goawk

xsv

goawk

What are some alternatives?