AngleSharp
playwright-dotnet
Our great sponsors
AngleSharp | playwright-dotnet | |
---|---|---|
15 | 17 | |
4,999 | 2,284 | |
1.5% | 2.8% | |
8.6 | 8.9 | |
11 days ago | 7 days ago | |
C# | C# | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AngleSharp
-
Migration of a Dynamic Website to a Static Website
Instead, what I ended up doing is utilizing AngleSharp for transforming the existing (dynamic) websites into static files. I've stored them on disk and made them ready to be served statically.
-
What do .NET devs use for web scraping these days?
AngleSharp
-
School Web scraping
Read and understand the HTML DOM with something like this https://anglesharp.github.io/
- Text editor library/HTML parser options for blogging app
-
Powershell/HTML: Apply color to HTML table based on results
Personally, if that script I wrote did need to be used on Core I probably would have compiled AngleSharp, loaded the HTML Parser DLL into the script, and used that.
-
C# Web Scraper to check a sent Form
Now, assuming you have to scrape, C# does have two decent scraping libraries: AngleSharp and Html Agility Pack. I personally prefer AngleSharp.
-
Tell me your common tasks!
I would love a blog post or even a series of them about scraping local HTML files with either AngleSharp or HTMLAgilityPack.
-
How can I grab onto the html class name of a string that contains html, using c#?
If you want a cleaner and modern alternative to HtmlAgilityPack then I'd suggest https://github.com/AngleSharp/AngleSharp
- How to parse HTML in .NET - C# libraries comparison and benchmarks in 2021
-
Web scraping is so complicated, i cant specify a class only an attribute
Use anglesharp for parsing html
playwright-dotnet
-
What do .NET devs use for web scraping these days?
Playwright
-
How do control a browser?
Check out Playwright which is developed by Microsoft. It's like Selenium, but newer and maybe better. You can use it outside unit tests as a regular library.
-
Recommendations for open source projects to work on?
SDET? Check out Playwright's .NET connector: https://github.com/microsoft/playwright-dotnet
-
What is the best way to simulate "open an web browser application and login" without having to open 400 browsers to get 400 users online?
I'd suggest looking into https://github.com/microsoft/playwright-dotnet, it has been so much nicer to work with than Selenium.
-
How to create a bot to do stuff on the internet I'm too lazy to do.
I believe playwright can do the job, https://playwright.dev/dotnet/
-
Razor Templates Examples including PDF Reports
Then you'd take that output and generate a PDF with Chrome: https://playwright.dev/dotnet/
-
QuestPDF 2021.10 - a new version of the open-source, MIT-licensed, C# library for generating PDF documents with fluent API, now with extended text capabilities. Please help me make it popular :)
If you've already got ASP.NET generating pages then you should try use this to save pages as PDF: https://github.com/microsoft/playwright-dotnet
-
Web Scrapping with F#
If you've ever wanted to pull data periodically from a website, or you are a QA automation person looking to do E2E (end to end) testing, then Playwright might be an option for you. Similar to Cypress or PhantomJS, Playwright is a library that allows you to automate ineractions with websites, you can even take screenshots and PDFs!
- Web Scrapping con F#
What are some alternatives?
HtmlAgilityPack - Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
PuppeteerSharp - Headless Chrome .NET API
CsQuery - CsQuery is a complete CSS selector engine, HTML parser, and jQuery port for C# and .NET 4.
playwright-sharp - .NET version of the Playwright testing and automation library. [Moved to: https://github.com/microsoft/playwright-sharp]
dotless - .NET Port of the ruby Less CSS lib
playwright-python - Python version of the Playwright testing and automation library.
FluentBootstrap
QuestPDF - QuestPDF is a modern open-source .NET library for PDF document generation. Offering comprehensive layout engine powered by concise and discoverable C# Fluent API. Easily generate PDF reports, invoices, exports, etc.
ExCSS - A CSS Parser for .NET. It's BADA55!
puppeteer - Node.js API for Chrome
Playwright - Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
robotframework-browser - Robot Framework Browser library powered by Playwright.