|10 days ago||2 months ago|
|BSD 3-clause "New" or "Revised" License||MIT License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
13 ways to scrape any public data from any website
6 projects | dev.to | 7 Oct 2022
variable.css(".X5PpBb::text").get() # returns a text value variable.css(".gs_a").xpath("normalize-space()").get() # https://github.com/scrapy/parsel/issues/192#issuecomment-1042301716 variable.css(".gSGphe img::attr(srcset)").get() # returns a attribute value variable.css(".I9Jtec::text").getall() # returns a list of strings values variable.xpath('th/text()').get() # returns text value using xpath
Web Scraping With Python (An Ultimate Guide)
3 projects | reddit.com/r/Python | 15 Sep 2022
Something I don't see discussed when this topic is brought up is that Scrapy's HTML parsing library, parsel, can be installed separately from scrapy itself. You can use it in place of beautifulsoup and, imo, it's much easier to use.
How to Crawl the Web with Scrapy
7 projects | news.ycombinator.com | 13 Sep 2021
We haven't tracked posts mentioning soupsieve yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
CSS-Minifier - This CSS Minifier tries to reduce the length of code by renaming class names and id names.
parsel-cli - cli for evaluating css and xpath selectors
colly - Elegant Scraper and Crawler Framework for Golang