Wikipedia-API
pastevents
Wikipedia-API | pastevents | |
---|---|---|
1 | 4 | |
619 | 4 | |
2.9% | - | |
9.0 | 3.5 | |
8 days ago | 10 months ago | |
Python | Python | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Wikipedia-API
-
Webscraping the intro of multiple wiki pages - where do I even start?
You might want to check the following python libraries: - wikipedia - wikipedia-api
pastevents
-
68k.news: Basic HTML Google News for Vintage Computers
I share the frustration with the major online news portals, and have in fact built my own portal powered by Wikipedia[1].
But eventually I realized that my biggest gripe with news today isn't the presentation but the content. And I'm not talking about biases or sensationalism – I'm talking about the news items themselves.
Much of what passes as news today is stuff like "15 people die when a copper mine collapses in Chile". I'm trying to get a big picture view of the world, and I don't believe that such stories are at all conducive to that endeavor. News as we know it is just an endless stream of random events, apparently selected according to a handful of crude criteria, the most important one being dead people. I've been a keen follower of global news for many years, and I don't feel that I'm understanding anything.
Where are the truly novel approaches to painting a picture of what the world is today? Where are the quantitative news portals, the event pattern search engines, the automatically derived trends? I'm still looking.
[1] https://pastevents.org
- Ask HN: Replacement for Google News?
-
Ask HN: Have you stopped reading most news?
I love Wikipedia's "Current Events" portal so much that I have built a search engine for it: https://pastevents.org/
PastEvents is my entry point to the news nowadays. It updates daily and I can look at source material and relevant articles linked directly from the event, as well as search the past for related events to understand chronology.
-
[OC] The most common terms in Wikipedia's "Current Events" between January 2003 and September 2022
Wikipedia text obtained via the PastEvents.org database (application source code available here).
What are some alternatives?
Wikipedia-Article-Scraper - A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
wik - wik is use to get information about anything on the shell using Wikipedia.
SerializeThis - Scripts to convert Html to JSON without APIs
pywikibot - A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See https://www.mediawiki.org/wiki/Developer_account for contributing.
taxopedia - Taxonomic trees (cladograms) from Wikipedia-scraped data.
wikipedia_ql - Query language for efficient data extraction from Wikipedia
NLP-Model-for-Corpus-Similarity - A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
word_cloud - A little word cloud generator in Python
MediaWiki-Tools - Tools for getting data from MediaWiki websites
mwparserfromhell - A Python parser for MediaWiki wikicode
wikiteam - Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2024, WikiTeam has preserved more than 600,000 wikis.