goawk vs miller

goawk

A POSIX-compliant AWK interpreter written in Go, with CSV support (by benhoyt)

Source Code

benhoyt.com

Suggest alternative

Edit details

miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON (by johnkerl)

Source Code

miller.readthedocs.io

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

goawk		miller
	Project
19	Mentions	63
1,885	Stars	8,553
-	Growth	-
7.1	Activity	9.1
8 days ago	Latest Commit	8 days ago
Go	Language	Go
MIT License	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

goawk

Posts with mentions or reviews of goawk. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-29.

GoAWK, an Awk interpreter written in Go (2018)
1 project | news.ycombinator.com | 3 Feb 2024
The Awk Programming Language, Second Edition
18 projects | news.ycombinator.com | 29 Jun 2023

TIL: GoAWK [1] - A POSIX-compliant AWK interpreter written in Go, with CSV support.
[1]: https://github.com/benhoyt/goawk
Looking for a script for csv file
1 project | /r/awk | 20 Mar 2023
Anyone else doing compiler work in Golang?
10 projects | /r/golang | 28 Feb 2023

Another nice project that I have used from time to time (and a very good source for insight) is the awk interpreter written in go https://github.com/benhoyt/goawk
Tool to interact with CSV
9 projects | /r/commandline | 27 Feb 2023

No, I want exactly the opposite - it should be a , b,c as a single string field containing a literal comma, and c. For example, https://github.com/benhoyt/goawk has csv support. https://github.com/benhoyt/goawk/blob/master/docs/csv.md - more info.
Why does awk parse '1&&x=1' as '1&&(x=1)' not '(1&&x)=1' when '&&' is high precedence than '='?
1 project | /r/ProgrammingLanguages | 11 Feb 2023

I've had a go at solving this in this PR -- feedback welcome. I don't love it, but oh well, it solves the problem at hand. Your comment pointed me in the right direction, thanks again.
Looking for programming languages created with Go
23 projects | /r/golang | 6 Nov 2022

There are quite a few re-implementations of scripting languages like Lua in Go. I've written an AWK interpreter in Go.
Oracle DB support in Benthos
8 projects | /r/golang | 7 Oct 2022

github.com/benhoyt/goawk -> this library lets you embed an AWK runtime in your applications, very easy to use and useful for enabling some powerful scripting in things you build
Brian Kernighan adds Unicode support to Awk (May, 2022)
13 projects | news.ycombinator.com | 20 Aug 2022

Yes, that's right. With my simplistic UTF-8-based implementation it turned length() -- for example -- from O(1) to O(N), turning O(N) algorithms which use length() into O(N^2). See this issue: https://github.com/benhoyt/goawk/issues/93
Similar with substr() and other string functions, which when operating as bytes are O(1), but become O(N) when trying to count the number of codepoints as UTF-8.
GNU Gawk has a fancier approach, which stores strings as UTF-8 as long as it can, but converts to UTF-32 if it needs to (eg: the string is non-ASCII and you call substr).
It looks like Brian Kernighan's code has the same issue with length() and substr(). I'm going to try to email him about this, as I think it's kind of a performance blocker.
Ask HN: Is having a Personal blog/brand worth it for you?
7 projects | news.ycombinator.com | 18 Jul 2022

I'm not sure if it was via my personal website or just my GitHub profile, but I got my current job at Canonical due to the CTO there reaching out about my GoAWK project (https://github.com/benhoyt/goawk). I get regular recruitment emails because I have my CV/resume online: most of them are very low-effort, but 1 in 20 or something are interesting emails where the recruiter has actually looked at my website and will tailor it personally. I also just enjoy technical writing, and get joy out of sharing it on HN. So it's "worth it" for me.

miller

Posts with mentions or reviews of miller. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-22.

Qsv: Efficient CSV CLI Toolkit
8 projects | news.ycombinator.com | 22 Dec 2023
jq 1.7 Released
33 projects | news.ycombinator.com | 6 Sep 2023

jq and miller[1] are essential parts of my toolbelt, right up there with awk and vim.
[1]: https://github.com/johnkerl/miller
Perl first commit: a “replacement” for Awk and sed
3 projects | news.ycombinator.com | 8 Jul 2023

> This works really well if your problem can be solved in one or two liners.
My personal comfort threshold is around the 100-line mark. It's even possible to write maintainable shell scripts up to 500 lines, but it mostly depends on the problem you're trying to solve, and the discipline of the programmer to follow best practices (use sane defaults, ShellCheck, etc.).
> It go bad very quickly when, say, you have two CSV files and want to join them the sql-way.
In that case we're talking about structured data, and, yeah, Perl or Python would be easier to work with. That said, depending on the complexity of the CSV, you can still go a long way with plain Bash with IFS/read(1) or tr(1) to split CSV columns. This wouldn't be very robust, but there are tools that handle CSV specifically[1], which can be composed in a shell script just fine.
So it's always a balancing act of being productive quickly with a shell script, or reaching out for a programming language once the tools aren't a good fit, or maintenance becomes an issue.
[1]: https://miller.readthedocs.io/
Need help on cleaning this data!!
1 project | /r/datacleaning | 13 Jun 2023

where mlr is from https://github.com/johnkerl/miller
Running weekly average
1 project | /r/bash | 10 Jun 2023

if this class of problems (i.e., csv/tsv data) is your main target you may find miller (https://github.com/johnkerl/miller) much more useful in the long run
GQL: A new SQL like query language for .git files written in Rust
2 projects | /r/programming | 9 Jun 2023

That said, you may be interested in Miller (https://github.com/johnkerl/miller) which provides similar capabilities for CSV, JSON, and XML files. It doesn't use a SQL grammar, but that's just the proverbial lipstick on the thing. I'm not the author, but I have used it and I see some parallels in use cases at the very least.
johnkerl/miller: Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
1 project | /r/devel | 8 Jun 2023
Any cli utility to create ascii/org mode tables?
3 projects | /r/commandline | 12 Apr 2023

worth giving Miller a shot
I wrote this iCalendar (.ics) command-line utility to turn common calendar exports into more broadly compatible CSV files.
6 projects | /r/commandline | 24 Mar 2023

CSV utilities (still haven't pick a favorite one...): https://github.com/harelba/q https://github.com/BurntSushi/xsv https://github.com/wireservice/csvkit https://github.com/johnkerl/miller
Miller: Like Awk, sed, cut, join, and sort for CSV, TSV, and tabular JSON
1 project | /r/hypeurls | 16 Mar 2023

What are some alternatives?

When comparing goawk and miller you can also consider the following projects:

bytehound - A memory profiler for Linux.

visidata - A terminal spreadsheet multitool for discovering and arranging data

tsv-utils - eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.

xsv - A fast CSV command line toolkit written in Rust.

awka - Revive awka - Awk to C Compiler

jq - Command-line JSON processor [Moved to: https://github.com/jqlang/jq]

intellij-awk - The missing IntelliJ IDEA language support plugin for AWK

dasel - Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.

tumblelog - A static tumblelog generator available as both a Perl and Python version

csvtk - A cross-platform, efficient and practical CSV/TSV toolkit in Golang

awk - One true awk

yq - yq is a portable command-line YAML, JSON, XML, CSV, TOML and properties processor

goawk vs bytehound miller vs visidata goawk vs tsv-utils miller vs xsv goawk vs awka miller vs jq goawk vs intellij-awk miller vs dasel goawk vs tumblelog miller vs csvtk goawk vs awk miller vs yq

Compare goawk vs miller and see what are their differences.

goawk

miller

goawk

miller

What are some alternatives?