curl-impersonate vs colly

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

curl-impersonate		colly
	Project
31	Mentions	39
3,319	Stars	22,165
-	Growth	1.8%
7.1	Activity	6.0
about 2 months ago	Latest Commit	9 days ago
Python	Language	Go
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

curl-impersonate

Posts with mentions or reviews of curl-impersonate. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-27.

Recent 'MFA Bombing' Attacks Targeting Apple Users
2 projects | news.ycombinator.com | 27 Mar 2024

> us[e] Akamai to block scraping
Would https://github.com/lwthiker/curl-impersonate help? Haven’t tried with Akamai, but did help with another widely used CDN that shall remain unnamed (but has successfully infused me with burning hate for their products after a couple of years’ worth of using an always-on VPN to bypass Internet censorship and/or a slightly unusual browser).
Curl-impersonate: Mimic real browsers' TLS handshake with curl
1 project | news.ycombinator.com | 8 Sep 2023
Get RSS feed for your Ko-Fi account
2 projects | dev.to | 17 Aug 2023

But before that, I had to create a development environment where I could do the coding. I used Docker and created a docker-compose.yml file on my local system to build a container. At first, I did that on an Arm based computer and the first problem appeared. Although RSS-Bridge was working fine, I couldn't get any data, and the reason was that Ko-Fi.com uses Cloudflare CDN. This is something that a lot of people had issues with in the past. RSS-Bridge solves that problem by using a special build of curl that can impersonate the four major browsers: Chrome, Edge, Safari & Firefox. But unfortunately, that library doesn't work well on Arm-based systems, so I had to move to my trusty Intel-based Linux computer.
curl-impersonate VS curl-impersonate-php - a user suggested alternative
2 projects | 2 Aug 2023
Found a way to bypass Cloudflare 403 forbidden in cURL, fetch
2 projects | /r/webscraping | 2 Jul 2023

Curl-Impersonate: https://github.com/lwthiker/curl-impersonate A special build of curl that can impersonate Chrome & Firefox
Weird API behavior: Only Postman and browser consistently work but making same request with requests library gets a Captcha instead.
2 projects | /r/webscraping | 22 Jun 2023
Web fingerprinting is worse than I thought
5 projects | news.ycombinator.com | 21 Mar 2023

I haven’t seen a custom build of Wget, but for Curl there is curl-impersonate[1].
[1] https://github.com/lwthiker/curl-impersonate
Using selenium with proxy still hit bot detection
2 projects | /r/webscraping | 16 Jan 2023
Devirtualizing Nike.com's Bot Protection (Part 1)
4 projects | news.ycombinator.com | 7 Jan 2023
Bypassing University Internet Restrictions for Legal Purposes (to access my homeservers/raspberry Pis/VPS)
2 projects | /r/selfhosted | 20 Dec 2022

If you're just trying to pull a file, the curl-impersonate could be a low-effort option.

colly

Posts with mentions or reviews of colly. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-01.

Scraping the full snippet from Google search result
3 projects | dev.to | 1 Jan 2024

SerpApi focuses on scraping search results. That's why we need extra help to scrape individual sites. We'll use GoColly package.
Show HN: Flyscrape – A standalone and scriptable web scraper in Go
6 projects | news.ycombinator.com | 11 Nov 2023

Interesting. Can you compare it to colly? [0]
Last time I looked it was the most popular choice for scraping in Go and I have some projects using it.
Is it similar? Does it have more/less features or is it more suited for a different use case? (Which one?)
[0] https://github.com/gocolly/colly
Colly: Elegant Scraper and Crawler Framework for Golang
1 project | news.ycombinator.com | 23 Aug 2023
New modern web crawling tool
2 projects | news.ycombinator.com | 30 Apr 2023

Sounds cool, but how is this different from Colly: https://github.com/gocolly/colly?
colly VS scrapemate - a user suggested alternative
2 projects | 15 Apr 2023
Web Scraping in Python: Avoid Detection Like a Ninja
2 projects | dev.to | 5 Apr 2023

We could write some snippets mixing all these, but the best option in real life is to use a tool with it all, like Scrapy, pyspider, node-crawler (Node.js), or Colly (Go).
Web scraping with Go
5 projects | /r/golang | 2 Apr 2023
Web scraper help
1 project | /r/golang | 1 Mar 2023

Unless you're specifically trying to do it using net/http, I recommend using colly. I've used it in a few scrappers and I love it!
Web Scraping in Golang
2 projects | dev.to | 7 Feb 2023

In this blog, we will be covering the basics of web scraping in Go using the Fiber and Colly frameworks. Colly is an open-source web scraping framework written in Go. It provides a simple and flexible API for performing web scraping tasks, making it a popular choice among Go developers. Colly uses Go's concurrency features to efficiently handle multiple requests and extract data from websites. It offers a wide range of customization options, including the ability to set request headers, handle cookies, follow redirects, and more
Learn how to scrape Trustpilot reviews using Go
4 projects | dev.to | 4 Feb 2023

github.com/gocolly/colly - popular and widely-used library for web scraping in Go. It provides a higher-level API than net/http and makes it easier to extract information from websites. It also provides features such as concurrency, automatic request retries, and support for cookies and sessions.

What are some alternatives?

When comparing curl-impersonate and colly you can also consider the following projects:

curl_cffi - Python binding for curl-impersonate via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.

GoQuery - A little like that j-thing, only in Go.

challenge-bypass-extension - DEPRECATED - Client for Privacy Pass protocol providing unlinkable cryptographic tokens

Scrapy - Scrapy, a fast high-level web crawling & scraping framework for Python.

puppeteer - Node.js API for Chrome

xpath - XPath package for Golang, supports HTML, XML, JSON document query.

SendWhatsppTextByJavaScript - Here is small JS Script for sending a message in a loop.

rod - A Devtools driver for web automation and scraping

static-curl - fully static builds of curl, runs anywhere

Geziyor - Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

browsercookie

Ferret - Declarative web scraping

curl-impersonate vs curl_cffi colly vs GoQuery curl-impersonate vs challenge-bypass-extension colly vs Scrapy curl-impersonate vs puppeteer colly vs xpath curl-impersonate vs SendWhatsppTextByJavaScript colly vs rod curl-impersonate vs static-curl colly vs Geziyor curl-impersonate vs browsercookie colly vs Ferret

Compare curl-impersonate vs colly and see what are their differences.

curl-impersonate

colly

curl-impersonate

colly

What are some alternatives?