AngleSharp
PuppeteerSharp
AngleSharp | PuppeteerSharp | |
---|---|---|
15 | 16 | |
5,001 | 3,168 | |
0.9% | 1.8% | |
8.6 | 9.3 | |
19 days ago | 3 days ago | |
C# | C# | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AngleSharp
-
Migration of a Dynamic Website to a Static Website
Instead, what I ended up doing is utilizing AngleSharp for transforming the existing (dynamic) websites into static files. I've stored them on disk and made them ready to be served statically.
-
What do .NET devs use for web scraping these days?
AngleSharp
-
School Web scraping
Read and understand the HTML DOM with something like this https://anglesharp.github.io/
- Text editor library/HTML parser options for blogging app
-
Powershell/HTML: Apply color to HTML table based on results
Personally, if that script I wrote did need to be used on Core I probably would have compiled AngleSharp, loaded the HTML Parser DLL into the script, and used that.
-
C# Web Scraper to check a sent Form
Now, assuming you have to scrape, C# does have two decent scraping libraries: AngleSharp and Html Agility Pack. I personally prefer AngleSharp.
-
Tell me your common tasks!
I would love a blog post or even a series of them about scraping local HTML files with either AngleSharp or HTMLAgilityPack.
-
How can I grab onto the html class name of a string that contains html, using c#?
If you want a cleaner and modern alternative to HtmlAgilityPack then I'd suggest https://github.com/AngleSharp/AngleSharp
- How to parse HTML in .NET - C# libraries comparison and benchmarks in 2021
-
Web scraping is so complicated, i cant specify a class only an attribute
Use anglesharp for parsing html
PuppeteerSharp
-
What do .NET devs use for web scraping these days?
PuppeteerSharp
-
(Free) Open-source PDF Generation/Export
Otherwise, any pdf to hml chromium based solution hosted via docker, like gotenberg](https://github.com/gotenberg/gotenberg) or browserless.io(which is free if you create open source). Generating pdfs from html directly in .NET was always a pain. Wkhtml (and wrappers that use it) uses WebKit and comes with a load of issues of its own, similar to running and styling anything in Safari. Using chromium based engine saves a lot of time as it's the most popular way of doing it these days. You can also use puppeteer-sharp with local chromium if you host your app on something that allows it(f.e. not Azure Functions).
- A Handlebar and Puppeteer Equivalent in C#?
-
Puppeteer Sharp: Crawl the Web using C# and Headless Chrome
Puppeteer Sharp is a port of the popular Headless Chrome NodeJS API built by Google. Puppeteer Sharp was written in C# and released in 2017 by Darío Kondratiuk to offer the same functionality to .NET developers.
-
Looking for a way to allow users to download a PDF containing client-side rendered graphs
there is a dotnet package for using headless chrome: https://www.puppeteersharp.com/ its a port of the nodejs package of the same name
-
HTMLtoPDF
For the actual conversion of html to pdf we use PuppeteerSharp (https://www.puppeteersharp.com ) but you need browserless for that, which requires an api key (don’t know the pricing)
-
Dynamic html parser (alternative of phantomJS)
The README in the GitHub site has been good enough for me: https://github.com/hardkoded/puppeteer-sharp
-
QuestPDF 2022.01 - a new version of the open-source, C# library for generating complex PDF documents with fluent API, now with complex table-layout support 🎉 Please help me make it popular 🚀
Iv'e been using https://github.com/hardkoded/puppeteer-sharp for html to pdf conversion for quite some time with success. (in both .net framework and .net core)
-
Last C# PDF doc/tutorial by Microsoft. Tomorrow, the PDF generation feature will be officially retired. So, I took this opportunity to archive this format. (Up to .NET 6)
Just generate HTML and use https://github.com/hardkoded/puppeteer-sharp to produce a PDF from headless Chrome.
-
Best PDF generation library to create nested tables
Another idea is to maybe generate the document as HTML first and then convert the HTML to a PDF (with somehting like PuppeteerSharp. I have found this more flexible in the past, but I have not personally done with with nested tables.
What are some alternatives?
HtmlAgilityPack - Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
CefSharp - .NET (WPF and Windows Forms) bindings for the Chromium Embedded Framework
CsQuery - CsQuery is a complete CSS selector engine, HTML parser, and jQuery port for C# and .NET 4.
DinkToPdf - C# .NET Core wrapper for wkhtmltopdf library that uses Webkit engine to convert HTML pages to PDF.
dotless - .NET Port of the ruby Less CSS lib
playwright-dotnet - .NET version of the Playwright testing and automation library.
FluentBootstrap
ASP.NET Core - ASP.NET Core is a cross-platform .NET framework for building modern cloud-based web applications on Windows, Mac, or Linux.
ExCSS - A CSS Parser for .NET. It's BADA55!
Playwright - Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
FlaUI - UI automation library for .Net