SaaSHub helps you find the best software and product alternatives Learn more →
Top 16 Python Playwright Projects
-
Project mention: Google and Anthropic are working on AI agents - so I made an open source alternative | dev.to | 2025-01-09
Integrating Ollama, Microsoft vision models and Playwright I've made a simple agent that can browse websites and data to answer your query.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
By the end of this blog, we'll explore three different ways to extract data from Crunchbase using Crawlee for Python. We'll fully implement two of them and discuss the specifics and challenges of the third. This will help us better understand how important it is to properly choose the right data source.
-
I actually built my own Playwright screenshotting software with this idea in mind too: https://shot-scraper.datasette.io/ - I wrote about using that for my project documentation here: https://simonwillison.net/2022/Oct/14/automating-screenshots...
Really it comes down to the team you are working with. If you have user-facing documentation authors who are happy with Markdown and Git you can probably get this to work.
-
Project mention: Current problems and mistakes of web scraping in Python and tricks to solve them! | dev.to | 2024-08-22
Middleware libraries are written by the community and are extending their functionality. For example, scrapy-playwright.
-
-
Botright
Botright, the most advance undetected, fingerprint-changing, captcha-solving, open-source automation framework. Build on Playwright, its as easy to use as it is to extend your code. Solving your Captchas for free with AI.
-
agentql
AgentQL is an AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing as UI changes and work across similar sites. Users can define structured data output, making AgentQL versatile for developers and data scientists.
We upgraded Stealth Mode to minimize bot detection when scraping or automating actions on third-party websites. Check out the launch post, this handy guide to Avoiding Bot Detection with Stealth Mode, or implement it today with our Stealth Mode Example Script (now in JavaScript, too!)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Implementing BDD with `pytest-bdd` and `pytest-playwright` for Web Testing | dev.to | 2024-06-30
pytest-playwright Documentation: pytest-playwright
-
dude
dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators
-
testzeus-hercules
Welcome to Hercules, the world's first open-source testing agent that's here to lift your testing burdens with the strength of a mythological hero.
Project mention: Show HN: I used DeepSeek and TestZeus to do agentic testing | news.ycombinator.com | 2025-01-28 -
linkedIn-scraper
A playwright bot which is implemented to scrape linkedin and store advertisement data in a database and telegram channel
-
-
Project mention: Show HN: Iauto – Low-Code automation tool that integrates LLM and RPA | news.ycombinator.com | 2024-02-25
-
Daraz-Global-WebScraper
A web scraper that scrape all the product links, names and prices from Daraz global (Nepal, Sri Lanka, Pakistan, Bangladesh)
-
Project mention: Show HN: Agent.exe, the easiest way to let 3.5 Sonnet control your machine | news.ycombinator.com | 2024-10-23
I built something similar (still no GUI) but for the in browser actions only,
I think in-browser actions are much safer and can be more predictable with easier to implement safeguards, but I would love to see how this concept pan out in the future!
PS: you can check it out on GitHub: https://github.com/SamDc73/WebTalk/
Please let me know what you guys think!
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Playwright discussion
Python Playwright related posts
-
Google and Anthropic are working on AI agents - so I made an open source alternative
-
AgentQL Launch Week Recap—make the web AI-ready
-
Stealth Mode—Enhanced Bot Detection Evasion—Launch week day 3
-
Show HN: Agent.exe, the easiest way to let 3.5 Sonnet control your machine
-
Docs as Code
-
Announcing Crawlee Python: Now you can use Python to build reliable web crawlers
-
Show HN: Tarsier – vision for text-only LLM web agents that beats GPT-4o
-
A note from our sponsor - SaaSHub
www.saashub.com | 9 Feb 2025
Index
What are some of the best open-source Playwright projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | playwright-python | 12,304 |
2 | crawlee-python | 5,221 |
3 | shot-scraper | 1,782 |
4 | scrapy-playwright | 1,093 |
5 | hrequests | 765 |
6 | Botright | 584 |
7 | agentql | 465 |
8 | playwright-pytest | 455 |
9 | dude | 426 |
10 | testzeus-hercules | 282 |
11 | linkedIn-scraper | 236 |
12 | solverecaptchas | 104 |
13 | iauto | 84 |
14 | Daraz-Global-WebScraper | 9 |
15 | WebTalk | 2 |
16 | examples | 1 |