spider
nipper
spider | nipper | |
---|---|---|
1 | 1 | |
638 | 121 | |
7.2% | - | |
9.5 | 0.0 | |
9 days ago | about 2 months ago | |
Rust | Rust | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spider
nipper
-
Yet another 'read it later' web app
Kind of. In the early stages of development, I was using mozilla/readability in Node.js, but I wanted to implement as many parts of it as possible in Rust, so I searched for such a crate. However, none of the crates worked as well as mozilla's crate, so I created my own readability-like module, based on an example I found at nipper crate. I added and modified various things to make it conform to mozilla's implementation.
What are some alternatives?
colly - Elegant Scraper and Crawler Framework for Golang
website-stalker - Track changes on websites via git
crusty-core - A small library for building fast and highly customizable web crawlers
leaf - Self-hostable read-it-later web app.
Crawler4j - Open Source Web Crawler for Java
readability.rs - Really fast readability
scraping-with-rust - 👾 scraping hacker news with rust
rust-bitcoin-indexer - Powerful & versatile Bitcoin Indexer, in Rust
chan-downloader - CLI to download all images/webms in a 4chan thread
crab - Python-based Scraping and parsing toolkit
hltv-rust - A client to fetch and parse data from HLTV.org (origin at https://foss.alic.dev/dist1ll/hltv-rust)
shrike - Data analysis infrastructure for the Neo N3 blockchain.