Spidey
abot
Our great sponsors
Spidey | abot | |
---|---|---|
2 | 1 | |
11 | 2,204 | |
- | - | |
9.5 | 0.0 | |
11 days ago | 11 months ago | |
C# | C# | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Spidey
-
I need data from a website. It is viable to create an API that scrapes the website and returns the data on an endpoint?
Didn't get a chance to reply earlier but depending on what you're trying to do, you might want a web crawler. I have a crawler on Github that I built for scraping in instances where someone doesn't have an API. If you go this route, I suggest doing it as a background task and go off cached data.
-
Recursion needed in small crawler
This may be overkill but I have library out there for building web crawlers. Spidey is the library. I'm not suggesting you use it but you could look at it for ideas. It uses a multithreaded, producer/consumer approach that avoids recursion and stack overflow issues. Use a queue, pull from the queue for each url, push new urls on when you find them. Do need to optimize my code a bit more but if it helps at all. But your issue is most likely the fact that you're finding a link to the page you are currently on. HashSet or List of found URLs would solve the issue.
abot
-
Can you build a web crawler in c#?
This can be done perfectly well in c#, https://github.com/sjdirect/abot for example.
What are some alternatives?
scrapyrt - HTTP API for Scrapy spiders
Modern.Forms - Cross-platform spiritual successor to Winforms for .NET 6+
scrapy-sanoma-kuntavaalit2021 - Fetch Sanoma kuntavaalit 2021 data [Moved to: https://github.com/raspi/scrapy-kuntavaalit2021-sanoma]
PdfSharpCore - Port of the PdfSharp library to .NET Core - largely removed GDI+ (only missing GetFontData - which can be replaced with freetype2)
btelnyy-ConfigLoader - a YAML like Config loader made in C#
pete - A library for building servers that utilize the OpenMetaverse Transport Protocol: a protocol for communicating world information in the metaverse.
wit-dotnet - .NET library for Wit.ai
RtMidi.Core - RtMidi for .Net Standard
RESTCountries.NET - ⚡Offline dotnet standard library to get information about countries
SimultaneousConsoleIO - C# command line tool for reading console input and writing console output at the same time.
NWNLogRotator - Automatically stores, cleans, and colorizes your Neverwinter Nights chat logs.