undetected-chromedriver vs pup

undetected-chromedriver

Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM) (by ultrafunkamsterdam)

Source Code

github.com

Suggest alternative

Edit details

pup

Parsing HTML at the command line (by ericchiang)

Suggest topics

Source Code

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

undetected-chromedriver		pup
	Project
40	Mentions	52
8,066	Stars	7,998
-	Growth	-
7.1	Activity	0.0
16 days ago	Latest Commit	about 1 month ago
Python	Language	HTML
GNU General Public License v3.0 only	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

undetected-chromedriver

Posts with mentions or reviews of undetected-chromedriver. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-08.

ad_clicker premium - Google/Bing Ads Clicker
2 projects | /r/IMadeThis | 8 Dec 2023

This command-line tool clicks ads for a certain query on Google/Bing search using undetected_chromedriver package. Supports proxy, running multiple simultaneous browsers, ad targeting/exclusion, and running in loop.
Getting an image from Nascar.com
1 project | /r/learnpython | 3 Jul 2023
Which Web Browser automation tool is the best?
1 project | /r/webscraping | 25 Jun 2023

You can check this out. https://github.com/ultrafunkamsterdam/undetected-chromedriver If i didn't understand you wrong then this is what you're asking for.
how to scrape this news website
1 project | /r/webscraping | 2 Jun 2023

403 often means that the server recognized the scraper and blocked you. If you use Selenium, this plugin is very good for passing bot detection: https://github.com/ultrafunkamsterdam/undetected-chromedriver.
🚀 Introducing ✨ Bose Framework - The Swiss Army Knife for Bot Developers 🤖
3 projects | dev.to | 24 May 2023

Ultrafunkamsterdam created a ChromeDriver that has excellent support for bypassing all major bot detection systems such as Distil, Datadome, Cloudflare, and others.
Craigslist
1 project | /r/webscraping | 29 Apr 2023

One solution would be to install Selenium and then scrape using a real browser like Chrome. If this solution gets blocked, you could install obfuscation plugins like this very good one: https://github.com/ultrafunkamsterdam/undetected-chromedriver
How to Avoid Bot Detection with Selenium
2 projects | dev.to | 14 Apr 2023

Undetected_ChromeDriver also works on Brave Browser and many other Chromium-based browsers. For more, you can check out this project on GitHub.
Thread Diario de Dudas, Consultas y Mitaps - 31/03
1 project | /r/argentina | 31 Mar 2023
undetected-chromedriver VS Selenium-Profiles - a user suggested alternative
2 projects | 26 Mar 2023
What is this I don't even... ('Undetected' Chromedriver?)
1 project | /r/Automate | 26 Mar 2023

pup

Posts with mentions or reviews of pup. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-06.

script to download some notes
1 project | /r/danklinuxusers | 10 Mar 2023

And lnk=$(curl -s https://www.selfstudys.com$url |grep "PDFFlip" | cut -d '"' -f 6) to lnk=$(curl -s https://www.selfstudys.com$url | pup "div#PDFF attr{source}" ) here pup will print content of source attribute from div tag with id PDFF i dont know that much about html & css so this is what i came up with. but i am sure you can also select class & make list of suburls from them. check out the video from bugswriter on pup or read docs from git hub for more info github link: https://github.com/ericchiang/pup
What monitoring tool do you use or recommend?
5 projects | /r/selfhosted | 6 Mar 2023

jq is pretty amazing. If you are comfortable with its jquery-like CSS selector syntax, then I should also mention a couple similar cli utilities that apply it to HTML: htmlp and pup.
Creating a data scraper as a beginner?
1 project | /r/learnprogramming | 2 Mar 2023

Regex is not a great tool for parsing web pages. Open up a browser dev tools window and select a bit of the page. Right click > copy... XPath expression or CSS selector. A proper web scraping tool will accept either of those. No muss, no fuss. You can even use simple command line tools: xpath or pup
December 5, 2022: FLiP Stack Weekly
17 projects | dev.to | 3 Dec 2022
Show HN: A tool like jq, but for parsing HTML
2 projects | news.ycombinator.com | 3 Dec 2022

This is HTML to JSON, written in Rust, and there's also pup[1] which I found out about just the other day on HN[2] which uses a very similar syntax (CSS selectors) but outputs HTML and is written in Go.
I can see room for both though it would interesting to have a more detailed comparison to go on (e.g. types of HTML, speed etc).
[1] https://github.com/ericchiang/pup
[2] https://news.ycombinator.com/item?id=33805732
Pup: Parsing HTML at the command line
1 project | /r/patient_hackernews | 30 Nov 2022

1 project | /r/hackernews | 30 Nov 2022
pup: Parsing HTML at the Command Line
1 project | /r/hypeurls | 30 Nov 2022

7 projects | news.ycombinator.com | 30 Nov 2022

It looks like the project became inactive for a bit and there are alternatives such as htmlq, etc. https://github.com/ericchiang/pup/issues/150
Converting field before delimiter to uppercase and how to replace with multiple newlines
1 project | /r/bash | 18 Nov 2022

Another tool worth mentioning is pup - it can produce JSON output which means you can pipe it to jq

What are some alternatives?

When comparing undetected-chromedriver and pup you can also consider the following projects:

selenium-python-helium - Lighter web automation for Python [Moved to: https://github.com/mherrmann/helium]

htmlq - Like jq, but for HTML.

Playwright - Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

xidel - Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

browser-fingerprinting - Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

gron - Make JSON greppable!

scrapy-cloudflare-middleware - A Scrapy middleware to bypass the CloudFlare's anti-bot protection

yq - Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

helium - Selenium-python but lighter: Helium is the best Python library for web automation. [Moved to: https://github.com/mherrmann/selenium-python-helium]

cascadia - Go cascadia package command line CSS selector

sillynium - Automate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements

ddgr - :duck: DuckDuckGo from the terminal

undetected-chromedriver vs selenium-python-helium pup vs htmlq undetected-chromedriver vs Playwright pup vs xidel undetected-chromedriver vs browser-fingerprinting pup vs gron undetected-chromedriver vs scrapy-cloudflare-middleware pup vs yq undetected-chromedriver vs helium pup vs cascadia undetected-chromedriver vs sillynium pup vs ddgr

Compare undetected-chromedriver vs pup and see what are their differences.

undetected-chromedriver

pup

undetected-chromedriver

pup

What are some alternatives?