dosage VS xidel

Compare dosage vs xidel and see what are their differences.

dosage

dosage is a comic strip downloader and archiver (by mftrhu)

xidel

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents. (by benibela)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
dosage xidel
1 18
0 652
- -
0.0 5.6
almost 5 years ago 27 days ago
Python Pascal
MIT License GNU General Public License v3.0 only
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

dosage

Posts with mentions or reviews of dosage. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-04-01.
  • Search web freely from command line?
    2 projects | /r/commandline | 1 Apr 2021
    What fundamentally is an HTML parser? I read Regex is not sufficient to parse HTML. What basic design feature does an HTML parser have that allows this? I mean, what procedures does it use to recognize content between tags, for example? For one thing, tags in HTML can nest. That's enough to make regular expressions insufficient to deal with it. Now, most regex engines can do more than what regular expressions are supposed to be, but the regex syntax itself is both unfriendly to this sort of thing, and complicated regexes can really slow down the engine. For another thing, the HTML you can find in the wild is exceptionally far from being well-behaved. Leaving aside the mess that has accumulated over the decades, just HTML5, for example, allows for tags to stand on their own. That is, a closing is not required, as it's implicitly added by the parser when encountering the next block-level element. If you are dealing with anything of the sort, a regex-based solution will get exceptionally messy, exceptionally quick. What you can use them for is chunking the input - as a lexer - or extracting snippets of text out of the page. Even then, I'd rather write b.select(".search-results > div > a:nth-of-type(1)") than something like s/]*class="[^"]*search-results[^"]*"[^>]*>/; s/]*>]*href="([^"]+)"[^>]*>([^<]+).*/ (not an exaggeration): the latter is not only much more complex, but also frail. You say bash is by its nature inadequate for this. How come? Does it not have good string handling and processing capabilities, for example? Nope. Bash is first and foremost a shell. It being a general-purpose programming language comes after, and it shows: it doesn't just make simple things complicated, it's also full of pitfalls (e.g., " vs ', how they behave with variable and tilde expansion, splitting of parameters, $@ vs $*...). Even just "bash scripting" is ambiguous as hell, because there are subtle and less-than-subtle differences between bash, sh and zsh, between different versions and implementations of those binaries. I'm no slouch when it comes to shell-wrangling - I like a challenge - but I still shellcheck my work, which doesn't stop me from running into issues when using scripts I wrote on a different computer. If I decided to use a real HTML parser, say one in Python, I want to find a way to pass the output directly, after executing curl, without having to save a file. Pass the output to what? You can just do whatever - print the search results? - from inside Python itself, and you can always write a shell wrapper if you want to do something with the output. I'm currently researching how to access the last output of a bash command, after it's been executed. I don't believe it's saved anywhere. A lot of commands can fill up the scrollback of your terminal, or just never end. Trying to grab the output of the last command would mean trying to grab the output of those commands, which would just pointlessly fill your disc.

xidel

Posts with mentions or reviews of xidel. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-06.

What are some alternatives?

When comparing dosage and xidel you can also consider the following projects:

jq - Command-line JSON processor [Moved to: https://github.com/jqlang/jq]

tools - all-in collection of productivity scripts, CLI tools, utility libraries, fuse filesystems, and also some stuff

pup - Parsing HTML at the command line

gron - Make JSON greppable!

yq - Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

xmltodict - Python module that makes working with XML feel like you are working with JSON

xmlq - filter xml in the command line with xpath

blog.rust-lang.org - Home of the Rust and Inside Rust blogs

JsonPath - Java JsonPath implementation

jsoup - jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.

hn-search - Hacker News Search

jq - Command-line JSON processor