Show HN: Flyscrape – A standalone and scriptable web scraper in Go

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • flyscrape

    Flyscrape is a command-line web scraping tool designed for those without advanced programming skills.

    Thanks for sharing! Just a small nit: the links at the bottom of this page are broken [1].

    [1]: https://github.com/philippta/flyscrape/blob/master/docs/read...

  • GoQuery

    A little like that j-thing, only in Go.

    Your comment was posted 4 minutes ago. That means you still have enough time to edit your comment to change it so it contains real URLs:

    <https://github.com/PuerkitoBio/goquery>

    <https://github.com/dop251/goja>

    (Please do not reply to this comment—I won't be able to delete it once the previous post is fixed if it contains replies.)

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • goja

    ECMAScript/JavaScript engine in pure Go

    Your comment was posted 4 minutes ago. That means you still have enough time to edit your comment to change it so it contains real URLs:

    <https://github.com/PuerkitoBio/goquery>

    <https://github.com/dop251/goja>

    (Please do not reply to this comment—I won't be able to delete it once the previous post is fixed if it contains replies.)

  • Geziyor

    Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

    Its been 8+ years since i started scraping. I even wrote a popular Go web scraping framework previously: (https://github.com/geziyor/geziyor).

    These days, I'm not even using Go for scraping, as the webpage changes makes me crazy, so I moved to Typescript+Playwright. (Crawlee framework is cool, while not strictly necessary).

    My favorite stack as of 2023: TypeScript+Playwright+Crawlee(Optional)

  • colly

    Elegant Scraper and Crawler Framework for Golang

    Interesting. Can you compare it to colly? [0]

    Last time I looked it was the most popular choice for scraping in Go and I have some projects using it.

    Is it similar? Does it have more/less features or is it more suited for a different use case? (Which one?)

    [0] https://github.com/gocolly/colly

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts