Can you suggest something more to grow in scraping?

This page summarizes the projects mentioned and recommended in the original post on /r/webscraping

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • detect-headless

    Access https://infosimples.github.io/detect-headless to run several headless detection tests against your browser.

  • Do you know how to detect headless browser? For example, you can check your bot here: https://github.com/infosimples/detect-headless. Try to make your bot undetectable.

  • Cerberus

    Lightweight, extensible data validation library for Python (by pyeve)

  • Other than that, have you looked into testing scrapers? Since scrapers are working with highly dynamic data writing good tests is quite a challange. For example, for parser monitoring using cerberus is a very cool tool which allows you to define loose requirements like "phone number should always be 9 numbers" etc.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts