bitextor
Hebrew-Tokenizer
bitextor | Hebrew-Tokenizer | |
---|---|---|
2 | 1 | |
279 | 25 | |
0.7% | - | |
5.9 | 0.0 | |
8 months ago | over 2 years ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bitextor
Hebrew-Tokenizer
-
I need help with Natural language Processing problem
I am told to use this tokenizer https://github.com/YontiLevin/Hebrew-Tokenizer and this for features extraction https://scikit-learn.org/stable/modules/feature_extraction.html:
What are some alternatives?
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
TheAlgorithms - All Algorithms implemented in Python
trankit - Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
python - Official Python client library for kubernetes
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
xontrib-output-search - Get identifiers, paths, URLs and words from the previous command output and use them for the next command in xonsh shell.
nematus - Open-Source Neural Machine Translation in Tensorflow
sentence-splitter - Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
rofimoji - Emoji, unicode and general character picker for rofi and rofi-likes
OpenNMT-py - Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Bible-Gematria-Interlinear-Explorer - View the gematria of the Bible. Explore Hebrew/Greek words and see their definitions. Explore all aspects of the Bible.