13 ways to scrape any public data from any website

This page summarizes the projects mentioned and recommended in the original post on dev.to

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. parsel

    Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

    variable.css(".X5PpBb::text").get() # returns a text value variable.css(".gs_a").xpath("normalize-space()").get() # https://github.com/scrapy/parsel/issues/192#issuecomment-1042301716 variable.css(".gSGphe img::attr(srcset)").get() # returns a attribute value variable.css(".I9Jtec::text").getall() # returns a list of strings values variable.xpath('th/text()').get() # returns text value using xpath

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. Scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    scrapy is a high-level webscraping framework designed to scrape data at scale and can be used to create a whole ETL pipeline.

  4. playwright-python

    Python version of the Playwright testing and automation library.

    playwright is a modern alternative to selenium. It can perform pretty much all interations as user would do i.e clicks, scrolls and many more.

  5. Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  6. lxml

    The lxml XML toolkit for Python

    Parsel is a library build to extract data from XML/HTML documents with XPath and CSS selectors support, and could be combined with regular expressions. It's usees lxml parser under the hood by default.

  7. insomnia

    The open-source, cross-platform API client for GraphQL, REST, WebSockets, SSE and gRPC. With Cloud, Local and Git storage.

    Copy URL as CURL (Bash) and use it with online CURL runner or tools such as Insomnia.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Where to start: Learning Web-scraping

    2 projects | /r/learnpython | 26 Jun 2021
  • I would like to scrape all posts in a subreddit?

    2 projects | /r/hacking | 8 Apr 2021
  • How to tell if a page uses JavaScript rendering (and what to do about it)

    2 projects | dev.to | 11 May 2026
  • How to Use rs-trafilatura with Scrapy

    1 project | dev.to | 3 Apr 2026
  • Best Python Web Scraping Libraries 2026

    1 project | dev.to | 11 Jan 2026

Did you know that Python is
the 1st most popular programming language
based on number of references?