wikitextparser
A Python library to parse MediaWiki WikiText (by 5j9)
DrQA
Reading Wikipedia to Answer Open-Domain Questions (by facebookresearch)
wikitextparser | DrQA | |
---|---|---|
1 | 1 | |
268 | 4,467 | |
- | 0.3% | |
9.0 | 0.0 | |
17 days ago | 7 months ago | |
Python | Python | |
GNU General Public License v3.0 only | GNU General Public License v3.0 or later |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wikitextparser
Posts with mentions or reviews of wikitextparser.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-04-06.
-
Updated: I've saved all of Wikipedia into a SQLITE database!
The use of regex seems inefficient, is there any reason why you didn't start with lxml or a purpose built parser like wikitextparser?
DrQA
Posts with mentions or reviews of DrQA.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-04-06.
-
Updated: I've saved all of Wikipedia into a SQLITE database!
Nice work! That AI "framework" (to summarize the RAVEN acronym somehow) of yours reminds me of an old project of myself years ago, using prolog and first order logic to build a QA engine and pulling data from wikipedia. Something I eventually abandoned due to changing philosophical views on human consciousness... - yet it was still a fun learning exercise mixing compiler theory and logical inference. Facebook once open sourced code for something similar https://github.com/facebookresearch/DrQA - also pulling raw data from wikipedia.
What are some alternatives?
When comparing wikitextparser and DrQA you can also consider the following projects:
mwparserfromhell - A Python parser for MediaWiki wikicode
Mediawiker - A plugin for Sublime Text editor that adds possibility to use it as Wiki Editor on MediaWiki-based sites like Wikipedia and many other.
PlainTextWikipedia - Convert Wikipedia database dumps into plaintext files
MediaWiki-Tools - Tools for getting data from MediaWiki websites
obsei - Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .