webscraping-open VS pup

Compare webscraping-open vs pup and see what are their differences.

webscraping-open

By reanalytics-databoutique

pup

Parsing HTML at the command line (by ericchiang)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
webscraping-open pup
2 52
- 8,000
- -
- 0.0
- 8 days ago
HTML
- MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

webscraping-open

Posts with mentions or reviews of webscraping-open. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-08-10.
  • Ask HN: What are the best tools for web scraping in 2022?
    33 projects | news.ycombinator.com | 10 Aug 2022
    I’m collecting my experience in using these tools in this “web scraping open knowledge project” on github (https://github.com/reanalytics-databoutique/webscraping-open...) and on my substack (http://thewebscraping.club/) for longer free content
  • Web Scraping Open Knowledge
    9 projects | news.ycombinator.com | 27 May 2022
    On the page about canvas fingerprinting[0], it only mentions Cloudflare. From what I can tell, reCaptcha v3 also uses canvas fingerprinting [1]

    [0] https://github.com/reanalytics-databoutique/webscraping-open...

    [1] https://brianwjoe.com/2019/02/06/how-does-recaptcha-v3-work/

pup

Posts with mentions or reviews of pup. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-06.

What are some alternatives?

When comparing webscraping-open and pup you can also consider the following projects:

Webscraping Open Project - The web scraping open project repository aims to share knowledge and experiences about web scraping with Python [Moved to: https://github.com/TheWebScrapingClub/webscraping-from-0-to-hero]

htmlq - Like jq, but for HTML.

linkedom - A triple-linked lists based DOM implementation.

xidel - Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

undetected-chromedriver - Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

gron - Make JSON greppable!

docker-selenium-lambda - The simplest demo of chrome automation by python and selenium in AWS Lambda

yq - Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

jq - Command-line JSON processor [Moved to: https://github.com/jqlang/jq]

cascadia - Go cascadia package command line CSS selector

openstates-scrapers - source for Open States scrapers

ddgr - :duck: DuckDuckGo from the terminal