mbfc_crawler
tanakai
mbfc_crawler | tanakai | |
---|---|---|
1 | 3 | |
16 | 263 | |
- | - | |
0.5 | 6.1 | |
almost 4 years ago | 5 months ago | |
Ruby | Ruby | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mbfc_crawler
-
Media Bias/Fact Check datasets or APIs?
Haven't tried it but I found a crawler that should create a json for the data at https://github.com/JeffreyATW/mbfc_crawler. I may look into other data sources in the coming weeks, will update if I see anything decent.
tanakai
- Tanakai: Modern web scraping framework written in Ruby
-
Tanakai 1.6.0 (web scraping gem) has been released with support to Ruby 3+
Tanakai intends to be a maintained fork of Kimurai, a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites.
-
Long life to Tanakai, a fork of Kimurai (a modern web scraping framework written in Ruby)
Long life to Tanakai, it has already got support to Chrome CDP through Apparition and Cuprite.
What are some alternatives?
manga2pdf - Simple Ruby script to download manga and merge the images into a single pdf file. Available with both CLI and GUI.
Wombat - Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
spidr - A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
cuprite - Headless Chrome/Chromium driver for Capybara
kimuraframework - Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites
apparition - Capybara driver for Chrome using CDP
instabot.rb - An instagram bot works without instagram api, only needs your username and password. written in ruby
Kimurai
vessel - Fast high-level web crawling Ruby framework
Mechanize - Mechanize is a ruby library that makes automated web interaction easy.
google-search-results-ruby - Google Search Results via SERP API Ruby Gem
Upton - A batteries-included framework for easy web-scraping. Just add CSS! (Or do more.)