goskyr vs Geziyor

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

goskyr		Geziyor
	Project
2	Mentions	2
32	Stars	2,480
-	Growth	0.4%
8.7	Activity	0.6
2 days ago	Latest Commit	7 months ago
Go	Language	Go
GNU General Public License v3.0 only	License	Mozilla Public License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

goskyr

Posts with mentions or reviews of goskyr. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-09.

No code command line webscraper
3 projects | /r/webscraping | 9 Mar 2023

I am currently building a webscraper, called goskyr, that can be run from the command line and is supposed to be easily configurable. So instead of having to write code to scrape a website you'd just write a configuration snippet and run the scraper. I realize that there are a number of gui based scraping services that make it extremely easy to setup a scraping process for any website, so for people having no coding experience whatsoever that would probably be the easiest solution. I'm trying to come close to those gui based solutions in terms of functionality by providing a 'smart' way of finding potentially interesting data/fields and letting the user select a subset in a terminal based ui. Also date extraction & parsing and the newly added machine learning capability is probably worth mentioning. Still, those other, gui based solutions are really awesome, eg octoparse or scrapestorm.
Crowdsourced concert scraping project
2 projects | /r/webscraping | 17 May 2022

I am currently working on a configurable command line webscraper, called goskyr and my first use case is collecting as much concert data as possible for this website idea I had, croncert.ch I am hoping that people other than me are willing to contribute to the scraper configuration file in this repository, https://github.com/jakopako/croncert-config, which also contains a github action to regularly run the scraper. What do you think? Could this work? How should I spread the word?

Geziyor

Posts with mentions or reviews of Geziyor. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-13.

Show HN: I scraped 25M Shopify products to build a search engine
4 projects | news.ycombinator.com | 13 Dec 2023

As someone who has scraped millions of items myself, I had success using Geziyor (https://github.com/geziyor/geziyor) built in Go. Shopify sites are especially easy to scrape because they tend to share the same product data formatting and don't hide it behind JS rendering.
Show HN: Flyscrape – A standalone and scriptable web scraper in Go
6 projects | news.ycombinator.com | 11 Nov 2023

Its been 8+ years since i started scraping. I even wrote a popular Go web scraping framework previously: (https://github.com/geziyor/geziyor).
These days, I'm not even using Go for scraping, as the webpage changes makes me crazy, so I moved to Typescript+Playwright. (Crawlee framework is cool, while not strictly necessary).
My favorite stack as of 2023: TypeScript+Playwright+Crawlee(Optional)

What are some alternatives?

When comparing goskyr and Geziyor you can also consider the following projects:

croncert-config - configuration and github actions for concertcloud.live (fka croncert.ch), a website that shows you concerts in various cities

colly - Elegant Scraper and Crawler Framework for Golang

Pholcus - Pholcus is a distributed high-concurrency crawler software written in pure golang

fitter - New way for collect information from the API's/Websites

jsonrpconion - Library for building JSON RPC services on Tor network

soup - Web Scraper in Go, similar to BeautifulSoup

Ferret - Declarative web scraping

lux - 👾 Fast and simple video download library and CLI tool written in Go

google-search-results-golang - Google Search Results GoLang API

Rendora - dynamic server-side rendering using headless Chrome to effortlessly solve the SEO problem for modern javascript websites

gichidan - Gichidan - CLI wrapper for Ichidan deep-web search engine.

goskyr vs croncert-config Geziyor vs colly goskyr vs colly Geziyor vs Pholcus goskyr vs fitter Geziyor vs jsonrpconion goskyr vs soup Geziyor vs Ferret goskyr vs lux Geziyor vs google-search-results-golang goskyr vs Rendora Geziyor vs gichidan

Compare goskyr vs Geziyor and see what are their differences.

goskyr

Geziyor

goskyr

Geziyor

What are some alternatives?