spidr
MetaInspector
spidr | MetaInspector | |
---|---|---|
- | 1 | |
792 | 1,021 | |
- | -0.4% | |
6.4 | 6.4 | |
3 months ago | about 2 months ago | |
Ruby | Ruby | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spidr
We haven't tracked posts mentioning spidr yet.
Tracking mentions began in Dec 2020.
MetaInspector
-
Show HN: A rich daily digest feed for Hacker News
I made a feed that contains rich information from every article (title, description, image), and also from the Hacker News discussion.
The feed is very convenient for reviewing Hacker News content weekly and picking what to read out of the myriad of articles.
The backend is Ruby on AWS Lambda, with the brilliant [metainspector gem](https://github.com/metainspector/metainspector) to extract rich content.
What are some alternatives?
Mechanize - Mechanize is a ruby library that makes automated web interaction easy.
spidy Web Crawler - The simple, easy to use command line web crawler.
Wombat - Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
instabot.rb - An instagram bot works without instagram api, only needs your username and password. written in ruby
LinkThumbnailer - Ruby gem that fetches images and metadata from a given URL. Much like popular social website with link preview.
crawlee - Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Upton - A batteries-included framework for easy web-scraping. Just add CSS! (Or do more.)
anemone - Anemone web-spider framework
Kimurai
pismo - Extracts machine-readable metadata and content from Web pages