Ruby Scraper

Open-source Ruby projects categorized as Scraper

Top 10 Ruby Scraper Projects

  1. Huginn

    Create agents that monitor and act on your behalf. Your agents are standing by!

    Project mention: Show HN: Mashups – Resurrecting Yahoo Pipes, my side project | news.ycombinator.com | 2025-01-06

    Also check out https://nodered.org/ and https://github.com/huginn/huginn if you're interested in free and open-source software you can run yourself.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. Wombat

    Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.

  4. kimuraframework

    Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites

  5. spidr

    A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use. (by postmodern)

  6. tanakai

    Tanakai is a modern web scraping framework written in Ruby. A fork of Kimurai.

  7. html2rss

    📰 Build RSS 2.0 feeds from websites (and JSON APIs) automatically or with a few CSS selectors.

  8. html2rss-web

    🕸 Generates RSS feeds of any website & serves to the web! Automatic scraping. Ready to use configs. Write your own. Rolling Docker releases for speedy updates.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. nhkore

    🇯🇵📰🗻 NHK News Web (Easy) word frequency (core list) scraper for Japanese language learners.

  11. rails-urltohtml

    A simple rails scrapper app to count html tags of a web page.

  12. chanCrawler

    A simple gem that crawls chans and retrieves visual content

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Ruby Scraper discussion

Log in or Post with

Ruby Scraper related posts

  • Show HN: Mashups – Resurrecting Yahoo Pipes, my side project

    3 projects | news.ycombinator.com | 6 Jan 2025
  • Create agents that monitor and act on your behalf

    1 project | news.ycombinator.com | 24 Mar 2024
  • Tanakai: Modern web scraping framework written in Ruby

    1 project | news.ycombinator.com | 25 Oct 2023
  • Are you using Huginn? If so do you have any latest documentation?

    1 project | /r/selfhosted | 15 Aug 2023
  • Generate RSS feed for any website using CSS selectors

    2 projects | news.ycombinator.com | 14 Jul 2023
  • What web scrapers do you recommend.

    1 project | /r/docker | 5 Jul 2023
  • Any recommendations for a open source replacement for If This Then That?

    2 projects | /r/opensource | 1 Jul 2023
  • A note from our sponsor - CodeRabbit
    coderabbit.ai | 26 Mar 2025
    Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →

Index

What are some of the best open-source Scraper projects in Ruby? This list will help you:

# Project Stars
1 Huginn 45,240
2 Wombat 1,316
3 kimuraframework 1,015
4 spidr 815
5 tanakai 278
6 html2rss 124
7 html2rss-web 101
8 nhkore 13
9 rails-urltohtml 5
10 chanCrawler 4

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai