Go Scraping

Open-source Go projects categorized as Scraping

Top 12 Go Scraping Projects

  • colly

    Elegant Scraper and Crawler Framework for Golang

    Project mention: Show HN: Flyscrape – A standalone and scriptable web scraper in Go | news.ycombinator.com | 2023-11-11

    Interesting. Can you compare it to colly? [0]

    Last time I looked it was the most popular choice for scraping in Go and I have some projects using it.

    Is it similar? Does it have more/less features or is it more suited for a different use case? (Which one?)

    [0] https://github.com/gocolly/colly

  • Ferret

    Declarative web scraping

  • InfluxDB

    Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.

  • Geziyor

    Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

    Project mention: Show HN: Flyscrape – A standalone and scriptable web scraper in Go | news.ycombinator.com | 2023-11-11

    Its been 8+ years since i started scraping. I even wrote a popular Go web scraping framework previously: (https://github.com/geziyor/geziyor).

    These days, I'm not even using Go for scraping, as the webpage changes makes me crazy, so I moved to Typescript+Playwright. (Crawlee framework is cool, while not strictly necessary).

    My favorite stack as of 2023: TypeScript+Playwright+Crawlee(Optional)

  • till

    DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code changes on your scraper. Integrates with any scraper in 5 minutes.

  • Dataflow kit

    Extract structured data from web sites. Web sites scraping.

  • antch

    Antch, a fast, powerful and extensible web crawling & scraping framework for Go

  • newser

    Newser is a simple utility to generate a pdf with you favorite news articles

  • Onboard AI

    Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.

  • goclone

    🌱 goclone - clone websites in a matter of seconds

    Project mention: Show HN: Goclone – your ultimate tool for offline web browsing | news.ycombinator.com | 2023-07-31
  • xdsl-exporter

    xDSL Prometheus Exporter

    Project mention: I created Prometheus Exporter with Go to scrape my xDSL Modem stats | /r/golang | 2023-03-17
  • goGetJS

    a tool for extracting, searching, and saving JavaScript files (with optional headless browser)

  • moviestills

    A small CLI app to scrap high-quality movie snapshots from various websites.

  • go-scraper

    This repo show how to Scrape different type of data

    Project mention: How to scrape different types of data in Golang Using Colly | dev.to | 2023-01-29

    GitHub Link

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-11-11.

Go Scraping related posts

Index

What are some of the best open-source Scraping projects in Go? This list will help you:

Project Stars
1 colly 21,202
2 Ferret 5,509
3 Geziyor 2,294
4 till 803
5 Dataflow kit 623
6 antch 252
7 newser 83
8 goclone 45
9 xdsl-exporter 45
10 goGetJS 30
11 moviestills 13
12 go-scraper 0
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com