HanLP VS Stanza

Compare HanLP vs Stanza and see what are their differences.

HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理 (by hankcs)

Stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages (by stanfordnlp)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
HanLP Stanza
3 8
31,951 7,013
- 1.0%
5.6 9.7
6 days ago 2 days ago
Python Python
Apache License 2.0 GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

HanLP

Posts with mentions or reviews of HanLP. We have used some of these posts to build our list of alternatives and similar projects.

We haven't tracked posts mentioning HanLP yet.
Tracking mentions began in Dec 2020.

Stanza

Posts with mentions or reviews of Stanza. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-06.
  • Parts of speech tagged for German
    3 projects | /r/German | 6 Jan 2023
    I use Python's spacy library: https://spacy.io/models/de or stanza: https://stanfordnlp.github.io/stanza/ each with their respective language models.
  • Off the shelf sentence parsers?
    2 projects | /r/LanguageTechnology | 26 Aug 2022
    stanza has a constituency parser. There's a model compatible with the dev branch with an accuracy of 95.8 on PTB, using Roberta as a bottom layer, so it's pretty decent at this point. (The currently released model is not as accurate, but it's easy to get the better model to you.) There's also Tregex as a Java addon which can very easily search for a noun phrase highest up in the tree: NP !>> NP will search for a noun phrase which is not dominated by any higher up noun phrase.
  • The Spacy NER model for Spanish is terrible
    2 projects | /r/LanguageTechnology | 20 Dec 2021

What are some alternatives?

When comparing HanLP and Stanza you can also consider the following projects:

spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python

NLTK - NLTK Source

BERT-NER - Pytorch-Named-Entity-Recognition-with-BERT

Jieba - 结巴中文分词

flair - A very simple framework for state-of-the-art Natural Language Processing (NLP)

pytext - A natural language modeling framework based on PyTorch

polyglot - Multilingual text (NLP) processing toolkit

trankit - Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

stanfordnlp - [Deprecated] This library has been renamed to "Stanza". Latest development at: https://github.com/stanfordnlp/stanza

PyTorch-NLP - Basic Utilities for PyTorch Natural Language Processing (NLP)

TextBlob - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

textacy - NLP, before and after spaCy