witokit
trankit
Our great sponsors
witokit | trankit | |
---|---|---|
1 | 1 | |
9 | 705 | |
- | - | |
2.6 | 6.5 | |
over 3 years ago | 9 days ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
witokit
trankit
-
Trankit v1.0.0 - An open-source Transformer-based Multilingual NLP Toolkit for 56 languages is out.
Trankit is written in Python and can be easily installed via pip. Our code and pretrained models are publicly available at: https://github.com/nlp-uoregon/trankit
What are some alternatives?
wit - WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
wiki_dump - A library that assists in traversing and downloading from Wikimedia Data Dumps and their mirrors.
Stanza - Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
wikiteam - Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2023, WikiTeam has preserved more than 350,000 wikis.
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
wp2git - Downloads and imports Wikipedia page histories to a git repository
argilla - Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
wiktextract - Wiktionary dump file parser and multilingual data extractor
flair - A very simple framework for state-of-the-art Natural Language Processing (NLP)
Sentimentanalysis - Language independent sentiment analysis
quantulum3 - Library for unit extraction - fork of quantulum for python3