The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Wiktextract Alternatives
Similar projects and alternatives to wiktextract
-
logseq
A local-first, non-linear, outliner notebook for organizing and sharing your personal knowledge base. Use it to organize your todo list, to write your journals, or to record your unique life.
-
organice
An implementation of Org mode without the dependency of Emacs - built for mobile and desktop browsers
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Maat
Validation and transformation library powered by deductive ascending parser. Made to be extended for any kind of project.
-
trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
-
Kotoba
Quickly search the built-in iOS dictionary to see definitions of words. Collect words you want to remember.
-
wordnote
A simple and elegant notebook to write new words and discover their meanings and synonyms https://wordnote.app
-
wikdict-gen
Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
wiktextract reviews and mentions
- Wiktionary dump file parser and multilingual data extractor
- How to Download All of Wikipedia onto a USB Flash Drive
-
I built a dictionary app even with more than and300 apps available at AppStore
Great work
I'm working on similar dictionary app and found wiktionary insanely usable as dictionary source.
Here is one more project aiming to make wiktionary data usable as json data structure: https://github.com/tatuylonen/wiktextract.
It has a link to a site https://kaikki.org/ which hosts dictionary data dumps.
-
Dynamically generating minimal pair decks for Anki
Hm, that would be a good idea... if I didn't have to download so much data (over 20GB for just audio?!). But, looking at the Python library that processed those dumps (https://github.com/tatuylonen/wiktextract), which is more manageable, using it would involve getting the WikiMedia dump file for every word on the list, then parsing them for the relevant data and what follows is mostly the same, except I end up with a bunch of cached files.
-
What are some of the best digital free dictionaries available online (even for commercial use)?
Many parsers are available. https://github.com/tatuylonen/wiktextract
-
Best Approach to importing a languages dictionary
I'd probably try pulling from Wiktionary, there looks to be a Python package that can do it here.
-
This is not perfect but it's a start
And the json is built with https://github.com/tatuylonen/wiktextract whom I have to thank
-
A note from our sponsor - WorkOS
workos.com | 26 Apr 2024
Stats
tatuylonen/wiktextract is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.
The primary programming language of wiktextract is Python.
Sponsored