lemmatization-lists
Awesome-pytorch-list
lemmatization-lists | Awesome-pytorch-list | |
---|---|---|
3 | 2 | |
303 | 14,985 | |
- | - | |
0.0 | 0.0 | |
over 2 years ago | 4 months ago | |
ODC Open Database License v1.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lemmatization-lists
-
Ambiguous spellings
It's a bit of a massive undertaking maintaining such a data set so it's mostly taken from https://github.com/michmech/lemmatization-lists At the top of the file you'll see some additional I've added to deal with personal pronouns and numbers.
-
Is there a text list of words and their variations?
Another one to add to your list: https://github.com/michmech/lemmatization-lists
-
Trying to build a lemmatizer from scratch
One approach might be to take a lemmatization list, like the lemma-token lists at https://github.com/michmech/lemmatization-lists/, and compile it into a Finite State Transducer. The Helsinki FST package, for instance, has an hfst-strings2fst command to compile pairs of strings into a transducer. You might need to do some reformatting of the input first.
Awesome-pytorch-list
-
Similar open source long library list to TF like Pytorch "ECOSYSTEM TOOLS"
I got the following as recombination from elsewhere - https://github.com/jtoy/awesome-tensorflow and there is one for pt as well https://github.com/bharathgs/Awesome-pytorch-list . Thx for the help :D
-
[D] Similar open source long list to TF like Pytorch "ECOSYSTEM TOOLS"
https://github.com/jtoy/awesome-tensorflow https://github.com/bharathgs/Awesome-pytorch-list
What are some alternatives?
trankit - Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
awesome-asyncio - A curated list of awesome Python asyncio frameworks, libraries, software and resources
tldr-transformers - The "tl;dr" on a few notable transformer papers (pre-2022).
awesome-codex - A list dedicated to products, demos and articles related to 🤖 OpenAI's Codex.
awesome-sentiment-analysis - Repository with all what is necessary for sentiment analysis and related areas
500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code - 500 AI Machine learning Deep learning Computer vision NLP Projects with code
thesaurus - Offline database of synonyms/thesaurus
awesome-document-understanding - A curated list of resources for Document Understanding (DU) topic
3D-Machine-Learning - A resource repository for 3D machine learning
awesome-deep-learning - A curated list of awesome Deep Learning tutorials, projects and communities.
build-your-own-x - Master programming by recreating your favorite technologies from scratch.
d2l-en - Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.