html-parser

Top 23 html-parser Open-Source Projects

  • htmlparser2

    The fast & forgiving HTML and XML parser

  • Project mention: Nue: A React/Vue/Vite/Astro Alternative | news.ycombinator.com | 2023-09-14

    I hear you! I went all-in to jQuery- scene. Even wrote a semi-famous library called "jQuery Tools" (oldies know). Then came React and I wrote Riot to simplify the syntax. Then I sidetracked to a startup world for (too) many years and watched aside how the frontend ecosystem grew to it's current dimensions.

    Node uses a single dependency, htmlparser2 [1], in the package.json [2]. The HTML parser is used to traverse the HTML that is written on the Nue files. I quickly _thought_ of writing my own parser, but right now I'm having my eyes staring at Bun's native HTML parsing capabilities. Instead of Node, I'm using Bun to develop everything. I need less dependencies with it, because things like JS minification or .env file parsing are biult in.

    [1]: https://github.com/fb55/htmlparser2

  • posthtml

    PostHTML is a tool to transform HTML/XML with JS plugins

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
  • HtmlAgilityPack

    Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

  • Project mention: Script invoking an Online Port Scan of your external IP, to test your firewall and port forwarder. | /r/PowerShell | 2023-07-06

    Pretty Straighforward. It uses an online port scanner , in this case https://www.speedguide.net/portscan.php parses the replies using HtmlAgilityPack .

  • Kanna

    Kanna(鉋) is an XML/HTML parser for Swift.

  • DiDOM

    Simple and fast HTML and XML parser

  • floki

    Floki is a simple HTML parser that enables search for nodes using CSS selectors.

  • myhtml

    Fast C/C++ HTML 5 Parser. Using threads.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • Atributika

    Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.

  • Oga

    Oga is an XML/HTML parser written in Ruby.

  • Fuzi

    A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

  • A HTML DOM parser written in PHP

    📜 Modern Simple HTML DOM Parser for PHP

  • skrape.it

    A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

  • Project mention: Ksoup - Koltin Multiplatform HTML Parser ⚡ | /r/androiddev | 2023-05-11

    What is wrong with skrape.it?

  • htmlquery

    htmlquery is golang XPath package for HTML query.

  • hickory

    HTML as data (by clj-commons)

  • deno-dom

    Browser DOM & HTML parser in Deno

  • Ksoup

    Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

  • Project mention: Compose Rich Text Editor 0.2.0 released, with a lot of new features | /r/Kotlin | 2023-05-22

    In this version 0.2.0 I added HTML support. Markdown support is coming to 0.3.0. Since the library is multiplatform and there's no Multiplatform HTML or Markdown parsers available, I built my own multiplatform parsing library which is Ksoup and for now it only supports HTML https://github.com/MohamedRejeb/Ksoup

  • ZMarkupParser

    ZMarkupParser is a pure-Swift library that helps you convert HTML strings into NSAttributedString with customized styles and tags.

  • minimize

    Minimize HTML

  • sax-wasm

    The first streamable, fixed memory XML, HTML, and JSX parser for WebAssembly.

  • Aris

    Aris - A fast and powerful tool to write HTML in JS easily. Includes syntax highlighting, templates, SVG, CSS autofixing, debugger support and more...

  • modest_ex

    Elixir library to do pipeable transformations on html strings (with CSS selectors)

  • htmldoc

    A token based HTML Document parser and minifier written in PHP. Extract attribute values and text using CSS selectors.

  • xmlhtml

    XML parser and renderer with HTML 5 quirks mode

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

html-parser related posts

Index

What are some of the best open-source html-parser projects? This list will help you:

Project Stars
1 htmlparser2 4,281
2 posthtml 2,924
3 HtmlAgilityPack 2,550
4 Kanna 2,383
5 DiDOM 2,173
6 floki 1,995
7 myhtml 1,622
8 Atributika 1,351
9 Oga 1,162
10 Fuzi 1,058
11 A HTML DOM parser written in PHP 815
12 skrape.it 752
13 htmlquery 697
14 hickory 620
15 deno-dom 381
16 Ksoup 312
17 ZMarkupParser 264
18 minimize 164
19 sax-wasm 161
20 Aris 88
21 modest_ex 32
22 htmldoc 21
23 xmlhtml 21

Sponsored
Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com