Top 5 Go Webscraping Projects
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
musicScraper
CLI tool for scraping information from musical websites (Rateyourmusic, Metal Archives), with nice album ASCII art
-
goquiz
Provide the scrapers to get the engineering multiple-choice questions across the sites, structure the data, populate the database, and serve API endpoints to retrieve the questions.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Project mention: My hobby project. musicScrapper - CLI tool for scraping information from musical websites (Rateyourmusic, Metal Archives), with nice album ASCII art | /r/golang | 2023-07-04Github
I've found myself writing the same session/proxy/rate limiting/header faking management code over and over for my scrapers. I've extracted it into it's own service that runs in docker and acts as a MITM proxy between you and target. It is client language agnostic, so you can write scrapers in python, node or whatever and still have great performance.
Highly recommend this approach, it allows you to separate infrastructure code, that gets highly complex as you need more requests, from actual spider/parser code that is usually pretty straightforward and project specific.
https://github.com/jkelin/forward-proxy-manager
Go Webscraping related posts
Index
What are some of the best open-source Webscraping projects in Go? This list will help you:
Project | Stars | |
---|---|---|
1 | soup | 2,126 |
2 | goskyr | 32 |
3 | musicScraper | 16 |
4 | forward-proxy-manager | 5 |
5 | goquiz | 2 |
Sponsored