Top 17 html-parser Open-Source Projects
-
-
-
SonarQube
Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
-
HtmlAgilityPack
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
You can use Html Agility Pack library to manage your HTML document string.
-
-
-
Project mention: Libraries for retrivieng html data from website | reddit.com/r/cpp_questions | 2021-10-09
if you want to parse it, there is only https://github.com/lexborisov/myhtml
-
In terms of parsing style I suggest sticking with a hand-written recursive-descent parser. Parser generators seem appealing at first, but I always ran into annoying limitations when using them (I wrote one in Ruby myself as well, and used this for this project). Parsing combinators are useful for small inputs, but I find them difficult to use for anything but simple cases.
-
Scout APM
Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
-
-
-
skrape.it
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
-
htmlquery
-
This file uses deno_dom (https://github.com/b-fuze/deno-dom) to catalogue and find links.
-
-
-
-
-
htmldoc
A token based HTML Document parser and minifier written in PHP. Extract attribute values and text using CSS selectors.
First let me fess up here, I have written a number of PHP minification programs in my career, and 3 years ago I started my own GitHub project to write an HTML minifier in PHP. Partly this article is to present that software to you, but it is also to show you the research I did myself to see how my project stacked up against the competition and improve my software, hopefully you will find the results interesting, as I did.
html-parser related posts
Index
What are some of the best open-source html-parser projects? This list will help you:
Project | Stars | |
---|---|---|
1 | posthtml | 2,786 |
2 | Kanna | 2,260 |
3 | HtmlAgilityPack | 2,114 |
4 | DiDOM | 1,966 |
5 | floki | 1,712 |
6 | myhtml | 1,532 |
7 | Oga | 1,151 |
8 | Fuzi | 963 |
9 | A HTML DOM parser written in PHP | 560 |
10 | skrape.it | 515 |
11 | htmlquery | 477 |
12 | deno-dom | 234 |
13 | minimize | 156 |
14 | sax-wasm | 125 |
15 | modest_ex | 32 |
16 | xmlhtml | 21 |
17 | htmldoc | 12 |
Are you hiring? Post a new remote job listing for free.