Sentence parser for Mandarin?

This page summarizes the projects mentioned and recommended in the original post on /r/ChineseLanguage

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • Jieba

    结巴中文分词

  • Jieba: Chinese text segmenter

  • OpenCC

    Conversion between Traditional and Simplified Chinese

  • OpenCC: convert between traditional and simplified Chinese, see also http://opencc.byvoid.com/

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • zhtext

    Tools for analyzing chinese texts

  • zhtext: Chinese text segmenter, Reddit thread here

  • wordfreq

    Discontinued Access a database of word frequencies, in various natural languages. [Moved to: https://github.com/rspeer/wordfreq] (by LuminosoInsight)

  • wordfreq: a database of word frequencies, in various natural languages

  • zhvocab

    Chinese vocab database, tagged by category

  • zhvocab: Chinese vocab database, tagged by category, Reddit thread here

  • python-pinyin-jyutping-sentence

    Convert a Chinese sentence to Pinyin or Jyutping

  • Pinyin/Jyutping Generator, Reddit thread here

  • annotator-js

  • annotator.js: a JavaScript+CSS library that annotates text with pinyin, Reddit thread here

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • dragonmapper

    Identification and conversion functions for Chinese text processing

  • Dragonmapper: identification and conversion functions for Chinese text processing

  • g2pC

    g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

  • g2pC: a context-aware grapheme-to-phoneme conversion module for Chinese, Reddit thread here

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts