Wtf_wikipedia Alternatives
Similar projects and alternatives to wtf_wikipedia
-
-
duckling
Language, engine, and tooling for expressing, testing, and evaluating composable language rules on input strings.
-
SurveyJS
Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
-
anon
tweet about anonymous Wikipedia edits from particular IP address ranges
-
autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
-
mlscraper
🤖 Scrape data from HTML websites automatically by just providing examples
-
scrapeghost
👻 Experimental library for scraping websites using OpenAI's GPT API.
-
wikipedia_ql
Query language for efficient data extraction from Wikipedia
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
wtf_wikipedia reviews and mentions
-
Experimental library for scraping websites using OpenAI's GPT API
This may finally be a solution for scraping wikipedia and turning it into structured data. (Or do we even need structured data in the post-AI age?)
Mediawiki is notorious for being hard to parse:
* https://github.com/spencermountain/wtf_wikipedia#ok-first- - why it's hard
* https://techblog.wikimedia.org/2022/04/26/what-it-takes-to-p... - an entire article about parsing page TITLES
* https://osr.cs.fau.de/wp-content/uploads/2017/09/wikitext-pa... - a paper published about a wikitext parser
Stats
spencermountain/wtf_wikipedia is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of wtf_wikipedia is JavaScript.