PuppeteerSharp
HtmlAgilityPack
Our great sponsors
PuppeteerSharp | HtmlAgilityPack | |
---|---|---|
16 | 28 | |
3,162 | 2,550 | |
3.3% | 1.4% | |
9.3 | 7.5 | |
3 days ago | about 1 month ago | |
C# | C# | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PuppeteerSharp
-
What do .NET devs use for web scraping these days?
PuppeteerSharp
-
(Free) Open-source PDF Generation/Export
Otherwise, any pdf to hml chromium based solution hosted via docker, like gotenberg](https://github.com/gotenberg/gotenberg) or browserless.io(which is free if you create open source). Generating pdfs from html directly in .NET was always a pain. Wkhtml (and wrappers that use it) uses WebKit and comes with a load of issues of its own, similar to running and styling anything in Safari. Using chromium based engine saves a lot of time as it's the most popular way of doing it these days. You can also use puppeteer-sharp with local chromium if you host your app on something that allows it(f.e. not Azure Functions).
- A Handlebar and Puppeteer Equivalent in C#?
-
Puppeteer Sharp: Crawl the Web using C# and Headless Chrome
Puppeteer Sharp is a port of the popular Headless Chrome NodeJS API built by Google. Puppeteer Sharp was written in C# and released in 2017 by Darío Kondratiuk to offer the same functionality to .NET developers.
-
Looking for a way to allow users to download a PDF containing client-side rendered graphs
there is a dotnet package for using headless chrome: https://www.puppeteersharp.com/ its a port of the nodejs package of the same name
-
HTMLtoPDF
For the actual conversion of html to pdf we use PuppeteerSharp (https://www.puppeteersharp.com ) but you need browserless for that, which requires an api key (don’t know the pricing)
-
Dynamic html parser (alternative of phantomJS)
The README in the GitHub site has been good enough for me: https://github.com/hardkoded/puppeteer-sharp
-
QuestPDF 2022.01 - a new version of the open-source, C# library for generating complex PDF documents with fluent API, now with complex table-layout support 🎉 Please help me make it popular 🚀
Iv'e been using https://github.com/hardkoded/puppeteer-sharp for html to pdf conversion for quite some time with success. (in both .net framework and .net core)
-
Last C# PDF doc/tutorial by Microsoft. Tomorrow, the PDF generation feature will be officially retired. So, I took this opportunity to archive this format. (Up to .NET 6)
Just generate HTML and use https://github.com/hardkoded/puppeteer-sharp to produce a PDF from headless Chrome.
-
Best PDF generation library to create nested tables
Another idea is to maybe generate the document as HTML first and then convert the HTML to a PDF (with somehting like PuppeteerSharp. I have found this more flexible in the past, but I have not personally done with with nested tables.
HtmlAgilityPack
-
Script invoking an Online Port Scan of your external IP, to test your firewall and port forwarder.
Pretty Straighforward. It uses an online port scanner , in this case https://www.speedguide.net/portscan.php parses the replies using HtmlAgilityPack .
-
Script to test the state of certain ports on your firewall from the outside
For your convienience, a function to install HtmlAgilityPack is provided in [Install-HtmlAgilityPack.ps1](Install-HtmlAgilityPack.ps1).
-
What do .NET devs use for web scraping these days?
HTMLAgilityPack
-
Copy Pasting Email Content Issue
I have had success with using https://html-agility-pack.net/ to load the html and then use the ParsedText function when i am in need of stripping the html elements out.
-
Content of a web page without markup
I've used Html Agility Pack for html parsing for ... jeez, nearly 20 years now. There are other more modern packages (like AngleSharp which is indeed quite nice) but Html Agility Pack has treated me well. Their main page has good starter examples.
- Web scraping and reading the string
-
APIs for UK supermarkets
To build on this; with HTML agility pack.
- PlayStationPlus - List of Games - Website
-
What is the best tool to scrape this table intensive web page via Table Name, Column Name and Row Name. Turn the page into an SQL table ?
One way you could do this is with .NET (i.e. C#), HTML Agility Pack, and some SQL library like SQLite (depending on how you want to store tables).
What are some alternatives?
CefSharp - .NET (WPF and Windows Forms) bindings for the Chromium Embedded Framework
AngleSharp - :angel: The ultimate angle brackets parser library parsing HTML5, MathML, SVG and CSS to construct a DOM based on the official W3C specifications.
DinkToPdf - C# .NET Core wrapper for wkhtmltopdf library that uses Webkit engine to convert HTML pages to PDF.
CsQuery - CsQuery is a complete CSS selector engine, HTML parser, and jQuery port for C# and .NET 4.
playwright-dotnet - .NET version of the Playwright testing and automation library.
FluentBootstrap
ASP.NET Core - ASP.NET Core is a cross-platform .NET framework for building modern cloud-based web applications on Windows, Mac, or Linux.
Markdig - A fast, powerful, CommonMark compliant, extensible Markdown processor for .NET
FlaUI - UI automation library for .Net
dotless - .NET Port of the ruby Less CSS lib
playwright-sharp - .NET version of the Playwright testing and automation library. [Moved to: https://github.com/microsoft/playwright-dotnet]
ExCSS - A CSS Parser for .NET. It's BADA55!