miller
scc
Our great sponsors
miller | scc | |
---|---|---|
63 | 18 | |
8,542 | 5,918 | |
- | - | |
9.1 | 8.2 | |
7 days ago | 8 days ago | |
Go | Go | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
miller
- Qsv: Efficient CSV CLI Toolkit
-
jq 1.7 Released
jq and miller[1] are essential parts of my toolbelt, right up there with awk and vim.
-
Perl first commit: a “replacement” for Awk and sed
> This works really well if your problem can be solved in one or two liners.
My personal comfort threshold is around the 100-line mark. It's even possible to write maintainable shell scripts up to 500 lines, but it mostly depends on the problem you're trying to solve, and the discipline of the programmer to follow best practices (use sane defaults, ShellCheck, etc.).
> It go bad very quickly when, say, you have two CSV files and want to join them the sql-way.
In that case we're talking about structured data, and, yeah, Perl or Python would be easier to work with. That said, depending on the complexity of the CSV, you can still go a long way with plain Bash with IFS/read(1) or tr(1) to split CSV columns. This wouldn't be very robust, but there are tools that handle CSV specifically[1], which can be composed in a shell script just fine.
So it's always a balancing act of being productive quickly with a shell script, or reaching out for a programming language once the tools aren't a good fit, or maintenance becomes an issue.
-
Need help on cleaning this data!!
where mlr is from https://github.com/johnkerl/miller
-
Running weekly average
if this class of problems (i.e., csv/tsv data) is your main target you may find miller (https://github.com/johnkerl/miller) much more useful in the long run
-
GQL: A new SQL like query language for .git files written in Rust
That said, you may be interested in Miller (https://github.com/johnkerl/miller) which provides similar capabilities for CSV, JSON, and XML files. It doesn't use a SQL grammar, but that's just the proverbial lipstick on the thing. I'm not the author, but I have used it and I see some parallels in use cases at the very least.
- johnkerl/miller: Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
-
Any cli utility to create ascii/org mode tables?
worth giving Miller a shot
-
I wrote this iCalendar (.ics) command-line utility to turn common calendar exports into more broadly compatible CSV files.
CSV utilities (still haven't pick a favorite one...): https://github.com/harelba/q https://github.com/BurntSushi/xsv https://github.com/wireservice/csvkit https://github.com/johnkerl/miller
- Miller: Like Awk, sed, cut, join, and sort for CSV, TSV, and tabular JSON
scc
-
Essential Command Line Tools for Developers
View on GitHub
-
Ask HN: Programs that saved you 100 hours? (2022 edition)
Going to say my own https://github.com/boyter/scc/ which I have used to turn down projects of "Oh we just need to do X"
It allows me to evaluate the code-base quickly and see where potential issues are, and find hidden complexity in the code. I have said no a lot due to it. The only reason it exists was because I got caught out from another project, which wasted months of my time.
Otherwise IntelliJ and the JetBrains IDE's in general.
-
Building a custom code search index in Go for searchcode.com
Very cool to see this here, Ben! It was fun beating the ins and outs of your work on this in the TZ discord.
Also, off-topic but as you know, I recently tried out your scc tool and am eagerly awaiting its support for Elixir templates (.eex, .heex)!
-
[media] Onefetch v2.13 is typically 2x faster and now supports ~100 programming languages
I believe tokei is the best rust option as of now, but despite my burning passion for rust I've switched to using scc instead as I find it faster and more convenient. Not really an option for you if you're trying to bake line counting into the binary, obviously.
-
Incremental Parsing in Go
I've seen some real world example where Go was as fast or faster than Rust for CPU / io intensive task.
Go is a fast language even with a GC.
- Goal: Pass all 4259065 tests in sqllogictest in 1 week
-
Large project uses Rust backend. My backend developer left. How hard is it for me to learn Rust and take over for him.
I don't trust your qualitative "LARGE" for the project. I would recommend you pass your project through something like a software metrics tool https://github.com/boyter/scc to better measure what you're up against in terms of Flutter/Dart AND Rust code base.
- A fast accurate code counter with complexity calculations and COCOMO estimates
- Fd: A simple, fast and user-friendly alternative to 'find'
-
Introducing next generation of no-code application builder with fullstack codebase export. Offering free application build service and 50% off for limited time.
BTW, for technically inclined, here is scc output for an exported inventory tracker app (you can see actual app build @ https://www.bappaz.com/video or step by step process to build the app @ https://www.bappaz.com/learn )
What are some alternatives?
visidata - A terminal spreadsheet multitool for discovering and arranging data
cloc - cloc counts blank lines, comment lines, and physical lines of source code in many programming languages.
xsv - A fast CSV command line toolkit written in Rust.
tokei-pie - Render tokei's output to interactive sunburst chart.
jq - Command-line JSON processor [Moved to: https://github.com/jqlang/jq]
croc - Easily and securely send things from one computer to another :crocodile: :package:
dasel - Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
Seaweed File System - SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. [Moved to: https://github.com/seaweedfs/seaweedfs]
csvtk - A cross-platform, efficient and practical CSV/TSV toolkit in Golang
Gor - GoReplay is an open-source tool for capturing and replaying live HTTP traffic into a test environment in order to continuously test your system with real data. It can be used to increase confidence in code deployments, configuration changes and infrastructure changes.
yq - yq is a portable command-line YAML, JSON, XML, CSV, TOML and properties processor
Hugo - The world’s fastest framework for building websites.