tanakai
mbfc_crawler
tanakai | mbfc_crawler | |
---|---|---|
3 | 1 | |
263 | 16 | |
- | - | |
6.1 | 0.5 | |
5 months ago | almost 4 years ago | |
Ruby | Ruby | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tanakai
- Tanakai: Modern web scraping framework written in Ruby
-
Tanakai 1.6.0 (web scraping gem) has been released with support to Ruby 3+
Tanakai intends to be a maintained fork of Kimurai, a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites.
-
Long life to Tanakai, a fork of Kimurai (a modern web scraping framework written in Ruby)
Long life to Tanakai, it has already got support to Chrome CDP through Apparition and Cuprite.
mbfc_crawler
-
Media Bias/Fact Check datasets or APIs?
Haven't tried it but I found a crawler that should create a json for the data at https://github.com/JeffreyATW/mbfc_crawler. I may look into other data sources in the coming weeks, will update if I see anything decent.
What are some alternatives?
Wombat - Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
manga2pdf - Simple Ruby script to download manga and merge the images into a single pdf file. Available with both CLI and GUI.
cuprite - Headless Chrome/Chromium driver for Capybara
spidr - A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
kimuraframework - Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites
apparition - Capybara driver for Chrome using CDP
instabot.rb - An instagram bot works without instagram api, only needs your username and password. written in ruby
Kimurai
vessel - Fast high-level web crawling Ruby framework
Mechanize - Mechanize is a ruby library that makes automated web interaction easy.
google-search-results-ruby - Google Search Results via SERP API Ruby Gem
Upton - A batteries-included framework for easy web-scraping. Just add CSS! (Or do more.)