Ruby web-scraping

Open-source Ruby projects categorized as web-scraping

Top 3 Ruby web-scraping Projects

  • spidr

    A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use. (by postmodern)

  • nokolexbor

    High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.

    Project mention: Ruby 3.3's YJIT: Faster While Using Less Memory | news.ycombinator.com | 2023-12-18

    Yes, we ended up replacing Nokogiri by Nokolexbor, our own port of lexbor parser with like almost full compatibility with Nokogiri APIs while being around 5x faster: https://github.com/serpapi/nokolexbor

  • PopRuby

    PopRuby: Clothing and Accessories for Ruby Developers. Fashion meets Ruby! Shop our fun Ruby-inspired apparel and accessories designed to celebrate the joy and diversity of the Ruby community.

  • socials_regex

    🪡 Social account detection and extraction in ruby, e.g. for crawling/scraping.

    Project mention: socials_regex new gem for Social account detection and extraction in ruby, e.g. for crawling/scraping. Detect and extract URLs of social accounts: throw in URLs, get back URLs of social media profiles by type. | /r/ruby | 2023-07-02

    Github: socials_regex

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-12-18.

Index

What are some of the best open-source web-scraping projects in Ruby? This list will help you:

Project Stars
1 spidr 788
2 nokolexbor 151
3 socials_regex 8
The modern identity platform for B2B SaaS
The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
workos.com