Is it possible to scrape a website protected by Cloudflare?

This page summarizes the projects mentioned and recommended in the original post on /r/golang

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • colly

    Elegant Scraper and Crawler Framework for Golang

  • surf

    Stateful programmatic web browsing in Go. (by headzoo)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • go-cloudflare-scraper

    A golang http.Transport layer that uses Otto to solve Cloudflare challenges

  • phantomgo

    a headless browser phantomjs for golang

  • And can't use https://github.com/k4s/phantomgo since it requires a local binary and the backend is living in a serverless provider.

  • CycleTLS

    Spoof TLS/JA3 fingerprints in GO and Javascript

  • A lot of websites nowadays add fingerprint checking. So even if you fake the headers - it wouldn't help, as cloudflare still knows you are making request from go / python / whatever. So there is a lib to spoof fingerptint - https://github.com/Danny-Dasilva/CycleTLS . It may work for you .

  • chromedp

    A faster, simpler way to drive browsers supporting the Chrome DevTools Protocol.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts