Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more โ
Top 23 Python Wikipedium Projects
-
wikiteam
Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2023, WikiTeam has preserved more than 350,000 wikis.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
pywikibot
A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See https://www.mediawiki.org/wiki/Developer_account for contributing.
-
WordDumb
A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
codex
CoDEx: A set of knowledge graph Completion Datasets Extracted from Wikidata and Wikipedia (by tsafavi)
-
Mediawiker
A plugin for Sublime Text editor that adds possibility to use it as Wiki Editor on MediaWiki-based sites like Wikipedia and many other.
-
japanese-words-to-vectors
Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.
-
Wikipedia-Article-Scraper
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
-
wiki_dump
A library that assists in traversing and downloading from Wikimedia Data Dumps and their mirrors.
-
NLP-Model-for-Corpus-Similarity
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
There's also https://github.com/earwig/mwparserfromhell, if you don't want to roll your own.
WikiTeam is working on the archival, with the usual XML dumps and image dumps. You can follow updates and see how to help:
https://github.com/WikiTeam/wikiteam/issues/465#issuecomment...
https://wiki.archiveteam.org/index.php/Miraheze
Already before the announcement we had XML dumps for thousands of Miraheze wikis.
Manual here: https://xxyzz.github.io/WordDumb/
Here is the github : https://github.com/pj8912/wiki-blog-automation clone it and follow the instructions to automate the process of creating your own movie plots website and have fun! ๐
Project mention: 68k.news: Basic HTML Google News for Vintage Computers | news.ycombinator.com | 2023-06-16I share the frustration with the major online news portals, and have in fact built my own portal powered by Wikipedia[1].
But eventually I realized that my biggest gripe with news today isn't the presentation but the content. And I'm not talking about biases or sensationalism โ I'm talking about the news items themselves.
Much of what passes as news today is stuff like "15 people die when a copper mine collapses in Chile". I'm trying to get a big picture view of the world, and I don't believe that such stories are at all conducive to that endeavor. News as we know it is just an endless stream of random events, apparently selected according to a handful of crude criteria, the most important one being dead people. I've been a keen follower of global news for many years, and I don't feel that I'm understanding anything.
Where are the truly novel approaches to painting a picture of what the world is today? Where are the quantitative news portals, the event pattern search engines, the automatically derived trends? I'm still looking.
[1] https://pastevents.org
Python Wikipedia related posts
- Miraheze to Shut Down
- 68k.news: Basic HTML Google News for Vintage Computers
- Processing Wikipedia Dumps With Python
- Experimental library for scraping websites using OpenAI's GPT API
- Show HN: Terminal Based Wikipedia
- Show HN: Terminal Based Wikipedia
- Show HN: Terminal Based Wikipedia
-
A note from our sponsor - InfluxDB
www.influxdata.com | 25 Apr 2024
Index
What are some of the best open-source Wikipedium projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | mwparserfromhell | 698 |
2 | wikiteam | 686 |
3 | pywikibot | 612 |
4 | wik | 607 |
5 | Wikipedia-API | 532 |
6 | wikipedia_ql | 357 |
7 | WordDumb | 332 |
8 | mwclient | 305 |
9 | isbntools | 202 |
10 | codex | 136 |
11 | Mediawiker | 134 |
12 | japanese-words-to-vectors | 83 |
13 | danker | 53 |
14 | wistalk | 24 |
15 | Wikipedia-Article-Scraper | 17 |
16 | wikifunctions | 13 |
17 | wiki_dump | 9 |
18 | NLP-Model-for-Corpus-Similarity | 9 |
19 | witokit | 9 |
20 | taxopedia | 7 |
21 | movie-blog-automation | 6 |
22 | MediaWiki-Tools | 4 |
23 | pastevents | 3 |
Sponsored