news-crawler
web-scraping
news-crawler | web-scraping | |
---|---|---|
1 | 43 | |
101 | 678 | |
- | - | |
0.0 | 0.0 | |
over 1 year ago | over 2 years ago | |
Python | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
news-crawler
-
[Request] - Looking for a good historical news dataset. Maybe using Reuters.com?
The only relevant things I found was the remains of an old financial dataset from 2015 and an un-maintained news crawler since Reuters changed their archive website and hard-limited the historical data you can access, starting 2020/03/08 - Thus, limiting my goal of having a widespread dataset in time.
web-scraping
What are some alternatives?
financial-news-dataset - Reuters and Bloomberg
araknomecha-scrapper - A data scrapper and importer helper to Corvo Astral discord bot.
covid-19-us-api - A REST API for the US COVID-19 data (nytimes)
awesome-systematic-trading - A curated list of awesome libraries, packages, strategies, books, blogs, tutorials for systematic trading. [Moved to: https://github.com/paperswithbacktest/awesome-systematic-trading]
tagalog-dictionary-scraper - Builds a Tagalog dictionary by collecting Tagalog words from tagalog.pinoydictionary.com
ask-hn-urls - find URLs posted as a part of comments on Hacker news Ask: HN section.
Abosar - অবসর 📚 A collection of short Bengali stories web scraped from various Bengali eMagazines and eNewspapers.
Letterboxd-friend-ranker - Program that computes, ranks a given user and their friends based on Letterboxd ratings
facebook_page_scraper - Scrapes facebook's pages front end with no limitations & provides a feature to turn data into structured JSON or CSV