structured-text-tools vs pup

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

structured-text-tools		pup
	Project
13	Mentions	52
6,870	Stars	8,000
-	Growth	-
8.1	Activity	0.0
29 days ago	Latest Commit	about 1 month ago
	Language	HTML
-	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

structured-text-tools

Posts with mentions or reviews of structured-text-tools. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-11-12.

Command line tools for manipulating structured text data
1 project | news.ycombinator.com | 16 Mar 2023
creating a text file in Linux
1 project | /r/linuxmasterrace | 25 Jan 2023

This works well in scripts and logs of all the commands you need to do to reproduce the current state of the system from a scratch install. Also can be used with diff -u and patch, sed, perl, and awk oneliners and structured text tools. You can also capture most of the commands using sudo logging feature but it won't capture the here documents. But for modest size files you can use newlines in echo commands. Note that commands which use redrection should use something like ~~~~ sudo bash -c "echo 'foo' >>file.txt" ~~~~ instead of "sudo echo foo >>file.txt" or "echo foo | sudo tee -a file.txt
Using Commandline to Process CSV Files
1 project | news.ycombinator.com | 14 Dec 2022

TFA is about how to handle csv files with awk. This might be useful in straightforward cases.
For all others I’d recommend to have a look at
https://github.com/dbohdan/structured-text-tools
which lists tools to handle structure text formats
Combine multiple files
1 project | /r/linuxquestions | 22 Nov 2022

in general, I'd pick something from https://github.com/dbohdan/structured-text-tools
Show HN: Xq – command-line XML and HTML beautifier and content extractor
7 projects | news.ycombinator.com | 12 Nov 2022
structured-text-tools: A list of command line tools for manipulating structured text data
1 project | /r/CKsTechNews | 24 Jun 2022
A list of command line tools for manipulating structured text data
1 project | news.ycombinator.com | 24 Jun 2022

1 project | /r/programming | 7 Sep 2021

2 projects | /r/commandline | 7 Sep 2021
What is your favourite Linux backup software and why?
6 projects | /r/linuxmasterrace | 25 Apr 2022

Also, here is a list of structured text tools. You may find some tools there that are helpful in editing configuration files from the command line. Or you can use "diff -u" to create a patch file (you need to save the patch files along with sudo.log) to recreate. Also, use sfdisk --dump and sfdisk --backup to save partition information in a form that can be used to recreate backups.

pup

Posts with mentions or reviews of pup. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-06.

script to download some notes
1 project | /r/danklinuxusers | 10 Mar 2023

And lnk=$(curl -s https://www.selfstudys.com$url |grep "PDFFlip" | cut -d '"' -f 6) to lnk=$(curl -s https://www.selfstudys.com$url | pup "div#PDFF attr{source}" ) here pup will print content of source attribute from div tag with id PDFF i dont know that much about html & css so this is what i came up with. but i am sure you can also select class & make list of suburls from them. check out the video from bugswriter on pup or read docs from git hub for more info github link: https://github.com/ericchiang/pup
What monitoring tool do you use or recommend?
5 projects | /r/selfhosted | 6 Mar 2023

jq is pretty amazing. If you are comfortable with its jquery-like CSS selector syntax, then I should also mention a couple similar cli utilities that apply it to HTML: htmlp and pup.
Creating a data scraper as a beginner?
1 project | /r/learnprogramming | 2 Mar 2023

Regex is not a great tool for parsing web pages. Open up a browser dev tools window and select a bit of the page. Right click > copy... XPath expression or CSS selector. A proper web scraping tool will accept either of those. No muss, no fuss. You can even use simple command line tools: xpath or pup
December 5, 2022: FLiP Stack Weekly
17 projects | dev.to | 3 Dec 2022
Show HN: A tool like jq, but for parsing HTML
2 projects | news.ycombinator.com | 3 Dec 2022

This is HTML to JSON, written in Rust, and there's also pup[1] which I found out about just the other day on HN[2] which uses a very similar syntax (CSS selectors) but outputs HTML and is written in Go.
I can see room for both though it would interesting to have a more detailed comparison to go on (e.g. types of HTML, speed etc).
[1] https://github.com/ericchiang/pup
[2] https://news.ycombinator.com/item?id=33805732
Pup: Parsing HTML at the command line
1 project | /r/patient_hackernews | 30 Nov 2022

1 project | /r/hackernews | 30 Nov 2022
pup: Parsing HTML at the Command Line
1 project | /r/hypeurls | 30 Nov 2022

7 projects | news.ycombinator.com | 30 Nov 2022

It looks like the project became inactive for a bit and there are alternatives such as htmlq, etc. https://github.com/ericchiang/pup/issues/150
Converting field before delimiter to uppercase and how to replace with multiple newlines
1 project | /r/bash | 18 Nov 2022

Another tool worth mentioning is pup - it can produce JSON output which means you can pipe it to jq

What are some alternatives?

When comparing structured-text-tools and pup you can also consider the following projects:

yq - yq is a portable command-line YAML, JSON, XML, CSV, TOML and properties processor

htmlq - Like jq, but for HTML.

tsv-utils - eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.

xidel - Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

python-benedict - :blue_book: dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.

gron - Make JSON greppable!

concise-encoding - The secure data format for a modern world

yq - Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

datasette - An open source multi-tool for exploring and publishing data

cascadia - Go cascadia package command line CSS selector

awesome-cli-apps - 🖥 📊 🕹 🛠 A curated list of command line apps

ddgr - :duck: DuckDuckGo from the terminal

structured-text-tools vs yq pup vs htmlq structured-text-tools vs tsv-utils pup vs xidel structured-text-tools vs python-benedict pup vs gron structured-text-tools vs concise-encoding pup vs yq structured-text-tools vs datasette pup vs cascadia structured-text-tools vs awesome-cli-apps pup vs ddgr

Compare structured-text-tools vs pup and see what are their differences.

structured-text-tools

pup

structured-text-tools

pup

What are some alternatives?