|6 months ago||over 1 year ago|
|MIT License||BSD 3-clause "New" or "Revised" License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Ask HN: What are some tools / libraries you built yourself?
news.ycombinator.com | 2021-05-16
I've been working on gazpacho  for last two years.
It's a general purpose web scraping library for Python that replaces BeautifulSoup + requests for most projects.
Just surpassed ~2K downloads every week!
We haven't tracked posts mentioning xmldataset yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
lxml - The lxml XML toolkit for Python
MarkupSafe - Safely add untrusted strings to HTML/XML markup.
xmltodict - Python module that makes working with XML feel like you are working with JSON
html5lib - Standards-compliant library for parsing and serializing HTML documents and fragments in Python
xhtml2pdf - A library for converting HTML into PDFs using ReportLab
selectolax - Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).
untangle - Converts XML to Python objects
bleach - Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes