goskyr vs colly

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

goskyr		colly
	Project
2	Mentions	39
32	Stars	22,205
-	Growth	1.0%
8.7	Activity	5.7
1 day ago	Latest Commit	13 days ago
Go	Language	Go
GNU General Public License v3.0 only	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

goskyr

Posts with mentions or reviews of goskyr. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-09.

No code command line webscraper
3 projects | /r/webscraping | 9 Mar 2023

I am currently building a webscraper, called goskyr, that can be run from the command line and is supposed to be easily configurable. So instead of having to write code to scrape a website you'd just write a configuration snippet and run the scraper. I realize that there are a number of gui based scraping services that make it extremely easy to setup a scraping process for any website, so for people having no coding experience whatsoever that would probably be the easiest solution. I'm trying to come close to those gui based solutions in terms of functionality by providing a 'smart' way of finding potentially interesting data/fields and letting the user select a subset in a terminal based ui. Also date extraction & parsing and the newly added machine learning capability is probably worth mentioning. Still, those other, gui based solutions are really awesome, eg octoparse or scrapestorm.
Crowdsourced concert scraping project
2 projects | /r/webscraping | 17 May 2022

I am currently working on a configurable command line webscraper, called goskyr and my first use case is collecting as much concert data as possible for this website idea I had, croncert.ch I am hoping that people other than me are willing to contribute to the scraper configuration file in this repository, https://github.com/jakopako/croncert-config, which also contains a github action to regularly run the scraper. What do you think? Could this work? How should I spread the word?

colly

Posts with mentions or reviews of colly. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-01.

Scraping the full snippet from Google search result
3 projects | dev.to | 1 Jan 2024

SerpApi focuses on scraping search results. That's why we need extra help to scrape individual sites. We'll use GoColly package.
Show HN: Flyscrape – A standalone and scriptable web scraper in Go
6 projects | news.ycombinator.com | 11 Nov 2023

Interesting. Can you compare it to colly? [0]
Last time I looked it was the most popular choice for scraping in Go and I have some projects using it.
Is it similar? Does it have more/less features or is it more suited for a different use case? (Which one?)
[0] https://github.com/gocolly/colly
Colly: Elegant Scraper and Crawler Framework for Golang
1 project | news.ycombinator.com | 23 Aug 2023
New modern web crawling tool
2 projects | news.ycombinator.com | 30 Apr 2023

Sounds cool, but how is this different from Colly: https://github.com/gocolly/colly?
colly VS scrapemate - a user suggested alternative
2 projects | 15 Apr 2023
Web Scraping in Python: Avoid Detection Like a Ninja
2 projects | dev.to | 5 Apr 2023

We could write some snippets mixing all these, but the best option in real life is to use a tool with it all, like Scrapy, pyspider, node-crawler (Node.js), or Colly (Go).
Web scraping with Go
5 projects | /r/golang | 2 Apr 2023
Web scraper help
1 project | /r/golang | 1 Mar 2023

Unless you're specifically trying to do it using net/http, I recommend using colly. I've used it in a few scrappers and I love it!
Web Scraping in Golang
2 projects | dev.to | 7 Feb 2023

In this blog, we will be covering the basics of web scraping in Go using the Fiber and Colly frameworks. Colly is an open-source web scraping framework written in Go. It provides a simple and flexible API for performing web scraping tasks, making it a popular choice among Go developers. Colly uses Go's concurrency features to efficiently handle multiple requests and extract data from websites. It offers a wide range of customization options, including the ability to set request headers, handle cookies, follow redirects, and more
Learn how to scrape Trustpilot reviews using Go
4 projects | dev.to | 4 Feb 2023

github.com/gocolly/colly - popular and widely-used library for web scraping in Go. It provides a higher-level API than net/http and makes it easier to extract information from websites. It also provides features such as concurrency, automatic request retries, and support for cookies and sessions.

What are some alternatives?

When comparing goskyr and colly you can also consider the following projects:

croncert-config - configuration and github actions for concertcloud.live (fka croncert.ch), a website that shows you concerts in various cities

GoQuery - A little like that j-thing, only in Go.

fitter - New way for collect information from the API's/Websites

Scrapy - Scrapy, a fast high-level web crawling & scraping framework for Python.

soup - Web Scraper in Go, similar to BeautifulSoup

xpath - XPath package for Golang, supports HTML, XML, JSON document query.

lux - 👾 Fast and simple video download library and CLI tool written in Go

rod - A Devtools driver for web automation and scraping

Rendora - dynamic server-side rendering using headless Chrome to effortlessly solve the SEO problem for modern javascript websites

Geziyor - Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

Ferret - Declarative web scraping

goskyr vs croncert-config colly vs GoQuery goskyr vs fitter colly vs Scrapy goskyr vs soup colly vs xpath goskyr vs lux colly vs rod goskyr vs Rendora colly vs Geziyor goskyr vs Geziyor colly vs Ferret

Compare goskyr vs colly and see what are their differences.

goskyr

colly

goskyr

colly

What are some alternatives?