word_forms
spacy-experimental
word_forms | spacy-experimental | |
---|---|---|
1 | 5 | |
617 | 95 | |
- | - | |
0.0 | 3.8 | |
over 3 years ago | 8 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
word_forms
-
Is there any alternative Python's word-forms?
However, I've had a peek into the word-forms source code and it seems very easy to translate to Java, it's basically parsing a big amount of data from text files. In my opinion, creating a Java version would be a nice open-source project, if there's anyone willing to do it.
spacy-experimental
-
Newbie question with Spacy Coreference Resolution
Trying to work with the newly released coreference resolution pipeline
-
spaCy just got an experimental feature to detect co-references
I think the details are mentioned here: https://github.com/explosion/spacy-experimental/releases/tag/v0.6.0
- SpanFinder is a new experimental spaCy component that identifies span boundaries
-
Cython Is 20
I can't speak for the parent commenter, but there is ofte. code 'around' the machine learning code that benefits from high-performance implementations. To give two examples:
1. We recently implemented an edit tree lemmatizer for spaCy. The machine learning model predicts labels that map to edit trees. However, in order to lemmatize tokens, the trees need to be applied. I implemented all the tree wrangling in Cython to speed up processing and save memory (trees can be encoded as compact C unions):
https://github.com/explosion/spaCy/blob/master/spacy/pipelin...
2. I am working on a biaffine parser for spaCy. Most implementations of biaffine parsing use a Python implementation of MST decoding, which is unfortunately quite slow. Some people have reported it to dominate parsing time (rather than a rather expensive transformer + biaffine layer). I have implemented MST decoding in Cython and it barely shows up in profiles:
https://github.com/explosion/spacy-experimental/blob/master/...
-
Utilizando Neural edit-tree lemmatization para o português
Nós iremos utilizar o template do edit_tree_lemmatizer contido da pasta de projetos do repositório https://github.com/explosion/spacy-experimental e modificaremos para treinar um modelo em português em vez de alemão.
What are some alternatives?
grungegirl - grungegirl is the hacker's drug encyclopedia. programmed in python for maximum modularity and ease of configuration.
neuralcoref - ✨Fast Coreference Resolution in spaCy with Neural Networks
dictionary - A list of the most popular English words.
warp - A Python framework for high performance GPU simulation and graphics
wordhoard - This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.
epython - EPython is a typed-subset of the Python for extending the language new builtin types and methods
tmatch - Super fast token matcher
nanobind - nanobind: tiny and efficient C++/Python bindings
simplenlg - Java API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version.
sentence-splitter - Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
textaugment - TextAugment: Text Augmentation Library
projects - 🪐 End-to-end NLP workflows from prototype to production