The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 html-parser Open-Source Projects
-
SurveyJS
Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
-
HtmlAgilityPack
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Atributika
Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.
-
skrape.it
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
-
Ksoup
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.
-
ZMarkupParser
ZMarkupParser is a pure-Swift library that helps you convert HTML strings into NSAttributedString with customized styles and tags.
-
Aris
Aris - A fast and powerful tool to write HTML in JS easily. Includes syntax highlighting, templates, SVG, CSS autofixing, debugger support and more...
-
htmldoc
A token based HTML Document parser and minifier written in PHP. Extract attribute values and text using CSS selectors.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I hear you! I went all-in to jQuery- scene. Even wrote a semi-famous library called "jQuery Tools" (oldies know). Then came React and I wrote Riot to simplify the syntax. Then I sidetracked to a startup world for (too) many years and watched aside how the frontend ecosystem grew to it's current dimensions.
Node uses a single dependency, htmlparser2 [1], in the package.json [2]. The HTML parser is used to traverse the HTML that is written on the Nue files. I quickly _thought_ of writing my own parser, but right now I'm having my eyes staring at Bun's native HTML parsing capabilities. Instead of Node, I'm using Bun to develop everything. I need less dependencies with it, because things like JS minification or .env file parsing are biult in.
[1]: https://github.com/fb55/htmlparser2
Project mention: Script invoking an Online Port Scan of your external IP, to test your firewall and port forwarder. | /r/PowerShell | 2023-07-06Pretty Straighforward. It uses an online port scanner , in this case https://www.speedguide.net/portscan.php parses the replies using HtmlAgilityPack .
What is wrong with skrape.it?
Project mention: Compose Rich Text Editor 0.2.0 released, with a lot of new features | /r/Kotlin | 2023-05-22In this version 0.2.0 I added HTML support. Markdown support is coming to 0.3.0. Since the library is multiplatform and there's no Multiplatform HTML or Markdown parsers available, I built my own multiplatform parsing library which is Ksoup and for now it only supports HTML https://github.com/MohamedRejeb/Ksoup
html-parser related posts
- Google Local Results AI Parser
- Ruby gem to parse structured data from Google Local Search Results
- Show HN: A Ruby Gem to Perform AI Powered Parsing for Google Local Results
- Ksoup - Koltin Multiplatform HTML Parser ⚡
- Web Scraping with PHP: Step-By-Step Tutorial
- Dumb idea for testing output of a static site generator: use on-page DOM inspection instead of playwright?
- How to learn how to make a Web Scraper in C++?
-
A note from our sponsor - WorkOS
workos.com | 25 Apr 2024
Index
What are some of the best open-source html-parser projects? This list will help you:
Project | Stars | |
---|---|---|
1 | htmlparser2 | 4,281 |
2 | posthtml | 2,924 |
3 | HtmlAgilityPack | 2,550 |
4 | Kanna | 2,383 |
5 | DiDOM | 2,173 |
6 | floki | 1,995 |
7 | myhtml | 1,622 |
8 | Atributika | 1,351 |
9 | Oga | 1,162 |
10 | Fuzi | 1,058 |
11 | A HTML DOM parser written in PHP | 815 |
12 | skrape.it | 752 |
13 | htmlquery | 697 |
14 | hickory | 620 |
15 | deno-dom | 381 |
16 | Ksoup | 312 |
17 | ZMarkupParser | 264 |
18 | minimize | 164 |
19 | sax-wasm | 161 |
20 | Aris | 88 |
21 | modest_ex | 32 |
22 | htmldoc | 21 |
23 | xmlhtml | 21 |
Sponsored