PuppeteerSharp
playwright-sharp
Our great sponsors
PuppeteerSharp | playwright-sharp | |
---|---|---|
16 | 4 | |
3,162 | 840 | |
3.3% | - | |
9.3 | 9.5 | |
4 days ago | almost 3 years ago | |
C# | C# | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PuppeteerSharp
-
What do .NET devs use for web scraping these days?
PuppeteerSharp
-
(Free) Open-source PDF Generation/Export
Otherwise, any pdf to hml chromium based solution hosted via docker, like gotenberg](https://github.com/gotenberg/gotenberg) or browserless.io(which is free if you create open source). Generating pdfs from html directly in .NET was always a pain. Wkhtml (and wrappers that use it) uses WebKit and comes with a load of issues of its own, similar to running and styling anything in Safari. Using chromium based engine saves a lot of time as it's the most popular way of doing it these days. You can also use puppeteer-sharp with local chromium if you host your app on something that allows it(f.e. not Azure Functions).
- A Handlebar and Puppeteer Equivalent in C#?
-
Puppeteer Sharp: Crawl the Web using C# and Headless Chrome
Puppeteer Sharp is a port of the popular Headless Chrome NodeJS API built by Google. Puppeteer Sharp was written in C# and released in 2017 by Darío Kondratiuk to offer the same functionality to .NET developers.
-
Looking for a way to allow users to download a PDF containing client-side rendered graphs
there is a dotnet package for using headless chrome: https://www.puppeteersharp.com/ its a port of the nodejs package of the same name
-
HTMLtoPDF
For the actual conversion of html to pdf we use PuppeteerSharp (https://www.puppeteersharp.com ) but you need browserless for that, which requires an api key (don’t know the pricing)
-
Dynamic html parser (alternative of phantomJS)
The README in the GitHub site has been good enough for me: https://github.com/hardkoded/puppeteer-sharp
-
QuestPDF 2022.01 - a new version of the open-source, C# library for generating complex PDF documents with fluent API, now with complex table-layout support 🎉 Please help me make it popular 🚀
Iv'e been using https://github.com/hardkoded/puppeteer-sharp for html to pdf conversion for quite some time with success. (in both .net framework and .net core)
-
Last C# PDF doc/tutorial by Microsoft. Tomorrow, the PDF generation feature will be officially retired. So, I took this opportunity to archive this format. (Up to .NET 6)
Just generate HTML and use https://github.com/hardkoded/puppeteer-sharp to produce a PDF from headless Chrome.
-
Best PDF generation library to create nested tables
Another idea is to maybe generate the document as HTML first and then convert the HTML to a PDF (with somehting like PuppeteerSharp. I have found this more flexible in the past, but I have not personally done with with nested tables.
playwright-sharp
-
.NET Core ASP - PDF Generation for hosting on Azure?
Or https://github.com/microsoft/playwright-sharp
-
WebBrowser class
There are multiple ways to do this. For web scraping like you describe, I would use Playwright-sharp.
-
Can I automatize navigating websites with C#?
I've used PuppeteerSharp for web automation with great success. I understand that project is now Playwright-Sharp. I would highly recommend that over something like Selenium which is really more for testing.
-
Durable Functions to Schedule Publish to Dev.To
Using either Puppeteer or Playwright to scrape a web page is super simple. Both have their own .NET ported versions like Puppeteer Sharp and Playwright Sharp respectively. However, they don't work on Azure Functions, unfortunately. More precisely, they work on your local dev environment, not on Azure instance. This post would be useful for your node.js Azure Functions app, but it's not that helpful for your .NET application. Let me find a way for it to work on Azure Functions instance correctly.
What are some alternatives?
CefSharp - .NET (WPF and Windows Forms) bindings for the Chromium Embedded Framework
Playwright - Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
DinkToPdf - C# .NET Core wrapper for wkhtmltopdf library that uses Webkit engine to convert HTML pages to PDF.
YamlDotNet - YamlDotNet is a .NET library for YAML
playwright-dotnet - .NET version of the Playwright testing and automation library.
qttabbar - QTTabBar is a small tool that allows you to use tab multi label function in Windows Explorer. https://www.yuque.com/indiff/qttabbar
ASP.NET Core - ASP.NET Core is a cross-platform .NET framework for building modern cloud-based web applications on Windows, Mac, or Linux.
WebDriverManager.Net - Automatic Selenium Webdriver binaries management for .Net
HtmlAgilityPack - Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
AngleSharp - :angel: The ultimate angle brackets parser library parsing HTML5, MathML, SVG and CSS to construct a DOM based on the official W3C specifications.
FlaUI - UI automation library for .Net