PyTorch-NLP
DISCONTINUED
Jieba
Our great sponsors
PyTorch-NLP | Jieba | |
---|---|---|
1 | 6 | |
2,180 | 31,855 | |
- | - | |
0.0 | 0.0 | |
9 months ago | over 1 year ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PyTorch-NLP
-
Introduction to PyTorch
PyTorch-NLP
Jieba
-
[OC] How Many Chinese Characters You Need to Learn to Read Chinese!
jieba to do Chinese word segmentation
-
Sentence parser for Mandarin?
Jieba: Chinese text segmenter
-
I'm looking for a specific vocab list
https://github.com/fxsjy/jieba/ (has some good word frequency data)
What are some alternatives?
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
PaddlePaddle - PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
NLTK - NLTK Source
SnowNLP - Python library for processing Chinese text
pkuseg-python - pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Stanza - Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
TextBlob - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
textacy - NLP, before and after spaCy
pytext - A natural language modeling framework based on PyTorch
polyglot - Multilingual text (NLP) processing toolkit