|2 months ago||3 months ago|
|GNU General Public License v3.0 only||BSD 3-clause "New" or "Revised" License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Web Scraping With Python (An Ultimate Guide)
3 projects | reddit.com/r/Python | 15 Sep 2022
I like it so much that I even wrote a REPL for it parsel-cli :) (it's a bit of a Frankenstein though as I'm working on a 2.0 release)
What does the process of web scraping actually look like?
2 projects | reddit.com/r/webscraping | 24 Apr 2022
For that I use my own little tool called parsel-cli which allows to quickly test parsing expressions on live web pages.
We haven't tracked posts mentioning pyquery yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
lxml - The lxml XML toolkit for Python
xmltodict - Python module that makes working with XML feel like you are working with JSON
selectolax - Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).
html5lib - Standards-compliant library for parsing and serializing HTML documents and fragments in Python
xhtml2pdf - A library for converting HTML into PDFs using ReportLab
MarkupSafe - Safely add untrusted strings to HTML/XML markup.
untangle - Converts XML to Python objects
gazpacho - 🥫 The simple, fast, and modern web scraping library
xmldataset - xmldataset: xml parsing made easy 🗃️
bleach - Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes
enaml-web - Build interactive websites with enaml