Awk-Batteries
goawk
Awk-Batteries | goawk | |
---|---|---|
8 | 19 | |
12 | 1,888 | |
- | - | |
1.8 | 7.2 | |
over 2 years ago | 11 days ago | |
Awk | Go | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Awk-Batteries
-
The Awk Programming Language, Second Edition
It's nice that everyone is supporting this, I've written a portable awk module that takes control of the parsing and it is SLOW (and a little buggy). I'm a little bummed that nobody will use it but this is truly a step in the right direction.
I guess for the people that are still using nawk, you can set up an AWK envvar so you can { awk -f $AWKU/ucsv.awk -f <(echo '{print NR, $1}') }
https://github.com/Nomarian/Awk-Batteries/blob/master/Units/...
-
Need help with awk script that keeps giving me syntax errors
if you have gawkextlib, you can -i csv, you can also download this and put it in your $AWKPATH then you just -i ucsv and use the csv file as normal. if you need an array with headers let me know.
-
Modernizing AWK, a 45-year old language, by adding CSV support
I wrote a parser that does what -F, does but correctly, you can see that its actually very difficult. Not only that, but there are extensions to csv, some csv have a header which means that instead of $1 $2 $3 ..., you just name the field instead, which means you have to ignore the first record. there's also other things, but csv is a difficult format to parse.
- Using AWK with CSV Files
-
Python was listed on "Harmful things", but why though ? http://harmful.cat-v.org/software
My attempt in awk
- Understanding AWK
- Understanding Awk
-
Getting better at Linux with mini-projects
awk lua
goawk
- GoAWK, an Awk interpreter written in Go (2018)
-
The Awk Programming Language, Second Edition
TIL: GoAWK [1] - A POSIX-compliant AWK interpreter written in Go, with CSV support.
[1]: https://github.com/benhoyt/goawk
- Looking for a script for csv file
-
Anyone else doing compiler work in Golang?
Another nice project that I have used from time to time (and a very good source for insight) is the awk interpreter written in go https://github.com/benhoyt/goawk
-
Tool to interact with CSV
No, I want exactly the opposite - it should be a , b,c as a single string field containing a literal comma, and c. For example, https://github.com/benhoyt/goawk has csv support. https://github.com/benhoyt/goawk/blob/master/docs/csv.md - more info.
-
Why does awk parse '1&&x=1' as '1&&(x=1)' not '(1&&x)=1' when '&&' is high precedence than '='?
I've had a go at solving this in this PR -- feedback welcome. I don't love it, but oh well, it solves the problem at hand. Your comment pointed me in the right direction, thanks again.
-
Looking for programming languages created with Go
There are quite a few re-implementations of scripting languages like Lua in Go. I've written an AWK interpreter in Go.
-
Oracle DB support in Benthos
github.com/benhoyt/goawk -> this library lets you embed an AWK runtime in your applications, very easy to use and useful for enabling some powerful scripting in things you build
-
Brian Kernighan adds Unicode support to Awk (May, 2022)
Yes, that's right. With my simplistic UTF-8-based implementation it turned length() -- for example -- from O(1) to O(N), turning O(N) algorithms which use length() into O(N^2). See this issue: https://github.com/benhoyt/goawk/issues/93
Similar with substr() and other string functions, which when operating as bytes are O(1), but become O(N) when trying to count the number of codepoints as UTF-8.
GNU Gawk has a fancier approach, which stores strings as UTF-8 as long as it can, but converts to UTF-32 if it needs to (eg: the string is non-ASCII and you call substr).
It looks like Brian Kernighan's code has the same issue with length() and substr(). I'm going to try to email him about this, as I think it's kind of a performance blocker.
-
Ask HN: Is having a Personal blog/brand worth it for you?
I'm not sure if it was via my personal website or just my GitHub profile, but I got my current job at Canonical due to the CTO there reaching out about my GoAWK project (https://github.com/benhoyt/goawk). I get regular recruitment emails because I have my CV/resume online: most of them are very low-effort, but 1 in 20 or something are interesting emails where the recruiter has actually looked at my website and will tailor it personally. I also just enjoy technical writing, and get joy out of sharing it on HN. So it's "worth it" for me.
What are some alternatives?
microperl-standalone
bytehound - A memory profiler for Linux.
sparklines - Text-based sparklines for the command line mimicking those of Edward Tufte.
tsv-utils - eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
csvquote - Enables common unix utlities like cut, awk, wc, head to work correctly with csv data containing delimiters and newlines
awka - Revive awka - Awk to C Compiler
awk - Random AWK code
intellij-awk - The missing IntelliJ IDEA language support plugin for AWK
maga-csv - GAWK CSV extension
tumblelog - A static tumblelog generator available as both a Perl and Python version
busybox-w32 - WIN32 native port of BusyBox.
awk - One true awk