InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 5 Ruby HTML/XML Parsing Projects
-
You may also be interested in https://github.com/gjtorikian/html-pipeline (or its main dependency, https://github.com/gjtorikian/selma), for high performance HTML manipulation.
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
nokolexbor
High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.
Project mention: Nokolexbor: Drop-in replacement for Nokogiri. 5.2x faster at parsing HTML | news.ycombinator.com | 2024-11-17 -
ROXML
ROXML is a module for binding Ruby classes to XML. It supports custom mapping and bidirectional marshalling between Ruby and XML using annotation-style class methods, via Nokogiri or LibXML.
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Ruby HTML/XML Parsing discussion
Ruby HTML/XML Parsing related posts
-
How we made a Ruby method 200x faster
-
Ruby 3.3's YJIT: Faster While Using Less Memory
-
Did you know Nokogiri now has opt-in HTML5 parsing?
-
Nokolexbor: A drop-in replacement for Nokogiri Up to 5.2x faster at parsing HTML
-
What should I be learning?
-
My favorite Ruby gems
-
Caching All Native Ruby Gem Platforms
-
A note from our sponsor - InfluxDB
www.influxdata.com | 13 Jun 2025
Index
What are some of the best open-source HTML/XML Parsing projects in Ruby? This list will help you:
# | Project | Stars |
---|---|---|
1 | HTML::Pipeline | 2,277 |
2 | Oga | 1,168 |
3 | nokolexbor | 344 |
4 | ROXML | 223 |
5 | HappyMapper | 151 |