Top 3 Ruby web-scraping Projects
-
spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use. (by postmodern)
-
nokolexbor
High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.
Project mention: Ruby 3.3's YJIT: Faster While Using Less Memory | news.ycombinator.com | 2023-12-18Yes, we ended up replacing Nokogiri by Nokolexbor, our own port of lexbor parser with like almost full compatibility with Nokogiri APIs while being around 5x faster: https://github.com/serpapi/nokolexbor
-
PopRuby
PopRuby: Clothing and Accessories for Ruby Developers. Fashion meets Ruby! Shop our fun Ruby-inspired apparel and accessories designed to celebrate the joy and diversity of the Ruby community.
-
Project mention: socials_regex new gem for Social account detection and extraction in ruby, e.g. for crawling/scraping. Detect and extract URLs of social accounts: throw in URLs, get back URLs of social media profiles by type. | /r/ruby | 2023-07-02
Github: socials_regex
Index
What are some of the best open-source web-scraping projects in Ruby? This list will help you:
Project | Stars | |
---|---|---|
1 | spidr | 788 |
2 | nokolexbor | 151 |
3 | socials_regex | 8 |