Nokolexbor: Drop-in replacement for Nokogiri. 5.2x faster at parsing HTML

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. nokolexbor

    High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. selectolax

    Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).

    Lexbor can also be used from Python: https://github.com/rushter/selectolax

  4. Nokogiri

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby.

    It seems to have an in-tree libxml 2.11 for XPath support, which was released in 2023-04. Almost every second libxml release comes with a CVE, so I'm curious if there's plans to upgrade the libxml version, since it doesn't use the system libxml (same as nokogiri).

    One of the reasons I still use nokogiri is because it puts a lot of effort into keeping libxml updated: https://github.com/sparklemotion/nokogiri/releases

  5. selma

    Selma selects and matches HTML nodes using CSS rules. Backed by Rust's lol_html parser. (by gjtorikian)

    You may also be interested in https://github.com/gjtorikian/selma for high performance HTML manipulation. It’s built on Rust—Cloudflare’s lol_html parser to be precise.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Lexbor – an open source HTML Renderer library

    6 projects | news.ycombinator.com | 11 Jun 2024
  • Nokolexbor: A drop-in replacement for Nokogiri Up to 5.2x faster at parsing HTML

    1 project | news.ycombinator.com | 6 Apr 2023
  • How we made a Ruby method 200x faster

    4 projects | news.ycombinator.com | 17 Nov 2024
  • Did you know Nokogiri now has opt-in HTML5 parsing?

    7 projects | /r/ruby | 5 Jun 2023
  • What should I be learning?

    3 projects | /r/ruby | 25 Oct 2022

Did you know that C is
the 6th most popular programming language
based on number of references?