Avoiding bot detection: How to scrape the web without getting blocked?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. browser-fingerprinting

    Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system đŸ‘» and get around browser fingerprinting scripts đŸ•”ïžâ€â™‚ïž when scraping the web?

    Try your technique on a few of these fingerprint testing sites https://github.com/niespodd/browser-fingerprinting#fingerpri... I'm pretty sure it's quite detectible

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. duo-bypass

    Stuff after reverse engineering DUO's mobile app.

    There are myriad ways of extracting the TOTP seed from these apps... Or you just reverse engineer the setup/confirmation process and then you can generate/trigger your own tokens from your automation workflow.

    2FA is a good security feature but it does not help against web scraping. Credential stuffing and other 3rd party attacks? Yes, it _can_ help. But it does not always help. There's a phishing group that has seemingly specialised on getting people to click the green confirm button in their Duo app... ¯\_(ツ)_/¯

    Check https://github.com/revalo/duo-bypass for a python script that can be used to automate Duo tokens... Has some code from me. There are similar scripts for all the other well known OTP Apps...

  4. quaid

    A single-page webapp that decrypts text using only client-side JavaScript

    This utility will help with that, assuming the services that use 2FA have a backup-code feature: https://github.com/sowbug/quaid

  5. undetected-chromedriver

    Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

    I've had a lot of success just with Selenium and this custom version of Chromedriver: https://github.com/ultrafunkamsterdam/undetected-chromedrive...

  6. I've had a lot of success just with Selenium and this custom version of Chromedriver: https://github.com/ultrafunkamsterdam/undetected-chromedrive...

  7. Playwright

    Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

    Playwright is easy to get started with. The even tools that allow you to record your browser actions and covert it into code ( https://playwright.dev/ ).

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Rebrowser Patches – Patches for undetectable browser automation

    1 project | news.ycombinator.com | 25 Apr 2025
  • ExperiĂȘncia dos candidatos numa vaga SĂȘnior

    1 project | /r/brdev | 8 May 2023
  • undetected-chromedriver VS Selenium-Profiles - a user suggested alternative

    2 projects | 26 Mar 2023
  • Devirtualizing Nike.com's Bot Protection (Part 1)

    4 projects | news.ycombinator.com | 7 Jan 2023
  • Is there an easy way to tell if a website will allow scrapers or not?

    2 projects | /r/webscraping | 5 Oct 2022