HtmlAgilityPack
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files. (by zzzprojects)
PuppeteerSharp
Headless Chrome .NET API (by hardkoded)
HtmlAgilityPack | PuppeteerSharp | |
---|---|---|
29 | 17 | |
2,742 | 3,643 | |
0.8% | 1.3% | |
7.3 | 8.1 | |
about 1 month ago | 4 days ago | |
C# | C# | |
MIT License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
HtmlAgilityPack
Posts with mentions or reviews of HtmlAgilityPack.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-08-09.
-
Create GitHub profile viewer in web API c#
HtmlAgilityPack is an HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. More information in Html Agility Pack
-
Script invoking an Online Port Scan of your external IP, to test your firewall and port forwarder.
Pretty Straighforward. It uses an online port scanner , in this case https://www.speedguide.net/portscan.php parses the replies using HtmlAgilityPack .
-
Script to test the state of certain ports on your firewall from the outside
For your convienience, a function to install HtmlAgilityPack is provided in [Install-HtmlAgilityPack.ps1](Install-HtmlAgilityPack.ps1).
-
What do .NET devs use for web scraping these days?
HTMLAgilityPack
-
Copy Pasting Email Content Issue
I have had success with using https://html-agility-pack.net/ to load the html and then use the ParsedText function when i am in need of stripping the html elements out.
-
Content of a web page without markup
I've used Html Agility Pack for html parsing for ... jeez, nearly 20 years now. There are other more modern packages (like AngleSharp which is indeed quite nice) but Html Agility Pack has treated me well. Their main page has good starter examples.
- Web scraping and reading the string
-
APIs for UK supermarkets
To build on this; with HTML agility pack.
- PlayStationPlus - List of Games - Website
PuppeteerSharp
Posts with mentions or reviews of PuppeteerSharp.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-06-13.
-
Automate Web Testing in C#: A Guide with PuppeteerSharp and SpecFlow
We often need to automate tasks or test applications as developers or testers. In such cases, there is usually limited time for manual testing, making PuppeteerSharp an ideal tool for task automation.
-
What do .NET devs use for web scraping these days?
PuppeteerSharp
-
(Free) Open-source PDF Generation/Export
Otherwise, any pdf to hml chromium based solution hosted via docker, like gotenberg](https://github.com/gotenberg/gotenberg) or browserless.io(which is free if you create open source). Generating pdfs from html directly in .NET was always a pain. Wkhtml (and wrappers that use it) uses WebKit and comes with a load of issues of its own, similar to running and styling anything in Safari. Using chromium based engine saves a lot of time as it's the most popular way of doing it these days. You can also use puppeteer-sharp with local chromium if you host your app on something that allows it(f.e. not Azure Functions).
- A Handlebar and Puppeteer Equivalent in C#?
-
Puppeteer Sharp: Crawl the Web using C# and Headless Chrome
Puppeteer Sharp is a port of the popular Headless Chrome NodeJS API built by Google. Puppeteer Sharp was written in C# and released in 2017 by Darío Kondratiuk to offer the same functionality to .NET developers.
-
Looking for a way to allow users to download a PDF containing client-side rendered graphs
there is a dotnet package for using headless chrome: https://www.puppeteersharp.com/ its a port of the nodejs package of the same name
-
HTMLtoPDF
For the actual conversion of html to pdf we use PuppeteerSharp (https://www.puppeteersharp.com ) but you need browserless for that, which requires an api key (don’t know the pricing)
-
Dynamic html parser (alternative of phantomJS)
The README in the GitHub site has been good enough for me: https://github.com/hardkoded/puppeteer-sharp
-
QuestPDF 2022.01 - a new version of the open-source, C# library for generating complex PDF documents with fluent API, now with complex table-layout support 🎉 Please help me make it popular 🚀
Iv'e been using https://github.com/hardkoded/puppeteer-sharp for html to pdf conversion for quite some time with success. (in both .net framework and .net core)
-
Last C# PDF doc/tutorial by Microsoft. Tomorrow, the PDF generation feature will be officially retired. So, I took this opportunity to archive this format. (Up to .NET 6)
Just generate HTML and use https://github.com/hardkoded/puppeteer-sharp to produce a PDF from headless Chrome.
What are some alternatives?
When comparing HtmlAgilityPack and PuppeteerSharp you can also consider the following projects:
AngleSharp - :angel: The ultimate angle brackets parser library parsing HTML5, MathML, SVG and CSS to construct a DOM based on the official W3C specifications.
DinkToPdf - C# .NET Core wrapper for wkhtmltopdf library that uses Webkit engine to convert HTML pages to PDF.
ExCSS - A CSS Parser for .NET. It's BADA55!
FlaUI - UI automation library for .Net
CsQuery - CsQuery is a complete CSS selector engine, HTML parser, and jQuery port for C# and .NET 4.
CefSharp - .NET (WPF and Windows Forms) bindings for the Chromium Embedded Framework