Is Selenium still a good choice?

This page summarizes the projects mentioned and recommended in the original post on /r/webscraping

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. playwright-pool

    Demonstration on how to use async python to control multiple playwright browsers for web-scraping

    But to summarize it - puppeteer and playwright are superior to Selenium. Mostly because they both have modern, async APIs. When it comes to API itself Playwright is a great choice, though it comes with a lot of default cruft (browser parameters etc) that make scrapers easier to identify. Async support is really important too as there's a lot of IO blocking in browser automation. With async API you can launch multiple asynchronous browser tabs and do something in one while the other is loading - which drastically speeds up web scraping. I published a short demo on github to illustrate this: playwright-pool if you want to learn more about async.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. puppeteer-extra

    💯 Teach puppeteer new tricks through plugins.

    That being said, if you're a beginner Selenium is a much more mature package so it has significantly more resources on StackOverflow and whatnot and Puppeteer has bigger community for avoiding web scraper detection (plugins like puppeteer-extra-plugin-stealth)

  4. scrapy-playwright

    🎭 Playwright integration for Scrapy

    This concern should be lifted if you are a Scrapy lover. There is a Scrapy integration for playwright, that gives you a lot of freedom and lets you operate from a Scrapy spider.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • I built an open source Chrome/Firefox extension that generates Playwright/Puppeteer scripts straight from your browser interactions using React/Shadow DOM

    4 projects | /r/reactjs | 16 Feb 2022
  • Headless recorder is a Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.

    1 project | /r/vuejs | 20 Jul 2021
  • Headless recorder is a Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.

    1 project | /r/programming | 20 Jul 2021
  • Headless recorder is a Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.

    1 project | /r/puppeteer | 20 Jul 2021
  • Headless recorder is a Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.

    1 project | /r/javascript | 20 Jul 2021