Web scraping with Python

This page summarizes the projects mentioned and recommended in the original post on dev.to

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. WolfensteinCGA

    Wolfenstein 3D with a CGA renderer

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. Playwright

    Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

    By definition, Playwright is an open-source framework for web testing and automation developed and maintained by Microsoft.

  4. httpx

    A next generation HTTP client for Python. 🦋

    HTTPX is a fully featured HTTP client library for Python 3, including an integrated command-line client while providing both sync and async APIs.

  5. scrapy-playwright

    🎭 Playwright integration for Scrapy

    To integrate Playwright with Scrapy, we will use the scrapy-playwright library. Then, we will scrape https://www.mintmobile.com/product/google-pixel-7-pro-bundle/ to demonstrate how to extract data from a website using Playwright and Scrapy.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Asynchronous HTTP Requests in Python with HTTPX and asyncio

    1 project | dev.to | 10 Jan 2025
  • Video data IO through ffmpeg subprocess

    1 project | dev.to | 18 Dec 2024
  • HTTPX: Dump requests library in a junkyard 🚀

    2 projects | dev.to | 22 Nov 2024
  • Httpx – next-generation HTTP client for Python

    1 project | news.ycombinator.com | 19 May 2024
  • A Retrospective on Requests

    2 projects | news.ycombinator.com | 27 Feb 2024