HtmlAgilityPack
playwright-dotnet
Our great sponsors
HtmlAgilityPack | playwright-dotnet | |
---|---|---|
28 | 17 | |
2,553 | 2,290 | |
1.5% | 3.0% | |
7.6 | 8.8 | |
2 days ago | 7 days ago | |
C# | C# | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
HtmlAgilityPack
-
Script invoking an Online Port Scan of your external IP, to test your firewall and port forwarder.
Pretty Straighforward. It uses an online port scanner , in this case https://www.speedguide.net/portscan.php parses the replies using HtmlAgilityPack .
-
Script to test the state of certain ports on your firewall from the outside
For your convienience, a function to install HtmlAgilityPack is provided in [Install-HtmlAgilityPack.ps1](Install-HtmlAgilityPack.ps1).
-
What do .NET devs use for web scraping these days?
HTMLAgilityPack
-
Copy Pasting Email Content Issue
I have had success with using https://html-agility-pack.net/ to load the html and then use the ParsedText function when i am in need of stripping the html elements out.
-
Content of a web page without markup
I've used Html Agility Pack for html parsing for ... jeez, nearly 20 years now. There are other more modern packages (like AngleSharp which is indeed quite nice) but Html Agility Pack has treated me well. Their main page has good starter examples.
- Web scraping and reading the string
-
APIs for UK supermarkets
To build on this; with HTML agility pack.
- PlayStationPlus - List of Games - Website
-
What is the best tool to scrape this table intensive web page via Table Name, Column Name and Row Name. Turn the page into an SQL table ?
One way you could do this is with .NET (i.e. C#), HTML Agility Pack, and some SQL library like SQLite (depending on how you want to store tables).
playwright-dotnet
-
What do .NET devs use for web scraping these days?
Playwright
-
How do control a browser?
Check out Playwright which is developed by Microsoft. It's like Selenium, but newer and maybe better. You can use it outside unit tests as a regular library.
-
Recommendations for open source projects to work on?
SDET? Check out Playwright's .NET connector: https://github.com/microsoft/playwright-dotnet
-
What is the best way to simulate "open an web browser application and login" without having to open 400 browsers to get 400 users online?
I'd suggest looking into https://github.com/microsoft/playwright-dotnet, it has been so much nicer to work with than Selenium.
-
How to create a bot to do stuff on the internet I'm too lazy to do.
I believe playwright can do the job, https://playwright.dev/dotnet/
-
Razor Templates Examples including PDF Reports
Then you'd take that output and generate a PDF with Chrome: https://playwright.dev/dotnet/
-
QuestPDF 2021.10 - a new version of the open-source, MIT-licensed, C# library for generating PDF documents with fluent API, now with extended text capabilities. Please help me make it popular :)
If you've already got ASP.NET generating pages then you should try use this to save pages as PDF: https://github.com/microsoft/playwright-dotnet
-
Web Scrapping with F#
If you've ever wanted to pull data periodically from a website, or you are a QA automation person looking to do E2E (end to end) testing, then Playwright might be an option for you. Similar to Cypress or PhantomJS, Playwright is a library that allows you to automate ineractions with websites, you can even take screenshots and PDFs!
- Web Scrapping con F#
What are some alternatives?
AngleSharp - :angel: The ultimate angle brackets parser library parsing HTML5, MathML, SVG and CSS to construct a DOM based on the official W3C specifications.
PuppeteerSharp - Headless Chrome .NET API
CsQuery - CsQuery is a complete CSS selector engine, HTML parser, and jQuery port for C# and .NET 4.
playwright-sharp - .NET version of the Playwright testing and automation library. [Moved to: https://github.com/microsoft/playwright-sharp]
playwright-python - Python version of the Playwright testing and automation library.
FluentBootstrap
QuestPDF - QuestPDF is a modern open-source .NET library for PDF document generation. Offering comprehensive layout engine powered by concise and discoverable C# Fluent API. Easily generate PDF reports, invoices, exports, etc.
Markdig - A fast, powerful, CommonMark compliant, extensible Markdown processor for .NET
puppeteer - Node.js API for Chrome
dotless - .NET Port of the ruby Less CSS lib
robotframework-browser - Robot Framework Browser library powered by Playwright.