|6 months ago||4 months ago|
|MIT License||MIT License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Ask HN: What are some tools / libraries you built yourself?
news.ycombinator.com | 2021-05-16
I've been working on gazpacho  for last two years.
It's a general purpose web scraping library for Python that replaces BeautifulSoup + requests for most projects.
Just surpassed ~2K downloads every week!
Like JQ, but for HTML
news.ycombinator.com | 2021-09-07
xmlstarlet is really nothing like jq, as a language. But yes, I use it because it is the best commandline xml processor I'd found. That's the only similarity to jq.
Is this the yq? https://kislyuk.github.io/yq/ It does contain an 'xq', as a literal wrapper for jq, piping output into it after transcoding XML to JSON using xmltodict https://github.com/martinblech/xmltodict (which explodes xml into separate JSON data structures).
This is a bash one-liner! But TBF it really is a 'jq for xml'. I think it would be horrible for some things, but you could also do a lot of useful things painlessly.
Parsing unknown XML file with Python?
reddit.com/r/learnpython | 2021-02-06
I used raw data from my watch (and Python) to make a map of all the NH48 hikes from this year. I hiked Liberty and Flume before I got the watch in June, so I need to do those again! Color-coded by altitude.
reddit.com/r/wmnf | 2021-01-05
Super-easy, take a look at xmltodict https://github.com/martinblech/xmltodict xmltodict.parse(xml_str) gets you a dictionary
What are some alternatives?
untangle - Converts XML to Python objects
lxml - The lxml XML toolkit for Python
pyquery - A jquery-like library for python
MarkupSafe - Safely add untrusted strings to HTML/XML markup.
xhtml2pdf - A library for converting HTML into PDFs using ReportLab
selectolax - Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).
bleach - Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes
xmldataset - xmldataset: xml parsing made easy 🗃️
html5lib - Standards-compliant library for parsing and serializing HTML documents and fragments in Python