word_forms
textaugment
word_forms | textaugment | |
---|---|---|
1 | 2 | |
617 | 406 | |
- | 1.7% | |
0.0 | 4.6 | |
over 3 years ago | 10 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
word_forms
-
Is there any alternative Python's word-forms?
However, I've had a peek into the word-forms source code and it seems very easy to translate to Java, it's basically parsing a big amount of data from text files. In my opinion, creating a Java version would be a nice open-source project, if there's anyone willing to do it.
textaugment
-
NLP augmentation models
I just came across this Python library. It has a bunch of dictionary-, backtranslation- and knowledge-based heuristics that should work most of the time:
- Prefer volume or quality for BERT-based Text classification model
What are some alternatives?
grungegirl - grungegirl is the hacker's drug encyclopedia. programmed in python for maximum modularity and ease of configuration.
tfops-aug - TFOps-Aug: Implementation of policy-based image augmentation techniques based on TF2 Operations. All augmentations as efficient Tensorflow 2.11.0 operations. Easy integration into a tf.data API pipeline.
dictionary - A list of the most popular English words.
wordnet - Stand-alone WordNet API
spacy-experimental - đŸ§ª Cutting-edge experimental spaCy components and features
AugLy - A data augmentations library for audio, image, text, and video.
wordhoard - This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.
scattertext - Beautiful visualizations of how language differs among document types.
tmatch - Super fast token matcher
magnitude - A fast, efficient universal vector embedding utility package.
simplenlg - Java API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version.
embeddings_plot - A command line utility to create a plots of word embeddings