Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
wordfreq
Discontinued Access a database of word frequencies, in various natural languages. [Moved to: https://github.com/rspeer/wordfreq] (by LuminosoInsight)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Jieba: Chinese text segmenter
OpenCC: convert between traditional and simplified Chinese, see also http://opencc.byvoid.com/
zhtext: Chinese text segmenter, Reddit thread here
wordfreq: a database of word frequencies, in various natural languages
zhvocab: Chinese vocab database, tagged by category, Reddit thread here
Pinyin/Jyutping Generator, Reddit thread here
annotator.js: a JavaScript+CSS library that annotates text with pinyin, Reddit thread here
Dragonmapper: identification and conversion functions for Chinese text processing
g2pC: a context-aware grapheme-to-phoneme conversion module for Chinese, Reddit thread here