SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Chinese Projects
-
funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具、国内电话号码正则匹配、清华大学XLORE:中英文跨语言百科知识图谱、清华大学人工智能技术系列报告、自然语言生成、NLU太难了系列、自动对联数据及机器人、用户名黑名单列表、罪名法务名词及分类模型、微信公众号语料、cs224n深度学习自然语言处理课程、中文手写汉字识别、中文自然语言处理 语料/数据集、变量命名神器、分词语料库+代码、任务型对话英文数据集、ASR 语音数据集 + 基于深度学习的中文
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Project mention: CosyVoice 2025 Complete Guide: The Ultimate Multi-lingual Text-to-Speech Solution | dev.to | 2025-12-15git clone --recursive https://github.com/FunAudioLLM/CosyVoice.git cd CosyVoice # If submodule cloning fails due to network issues git submodule update --init --recursive
-
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
-
-
-
-
-
-
-
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
-
-
Huatuo-Llama-Med-Chinese
Repo for BenCao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
-
-
-
-
-
OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
-
-
-
Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
-
-
Project mention: GameSentenceMiner – An All-in-One toolkit for learning Languages through games | news.ycombinator.com | 2025-11-24
Python Chinese discussion
Python Chinese related posts
-
Generating audiobooks from E-books with Kokoro-82M
-
What the heck is so great about this model?
-
New open-source LLM model Qwen 72B surpasses GPT4 in 4 of 10 benchmarks
-
Qwen (通义千问) chat and pretrained large language model by Alibaba Cloud
-
Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:263.0
-
Baichuan IA de China
-
Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:221.0
-
A note from our sponsor - SaaSHub
www.saashub.com | 6 Jun 2026
Index
What are some of the best open-source Chinese projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | funNLP | 80,502 |
| 2 | ChatTTS | 39,392 |
| 3 | CosyVoice | 21,440 |
| 4 | Qwen | 21,244 |
| 5 | ebook2audiobook | 19,168 |
| 6 | chinese-xinhua | 11,504 |
| 7 | GPT2-Chinese | 7,605 |
| 8 | InternLM | 7,216 |
| 9 | pkuseg-python | 6,702 |
| 10 | Baichuan-7B | 5,668 |
| 11 | awesome-pretrained-chinese-nlp-models | 5,568 |
| 12 | 汉字拼音转换工具(Python 版) | 5,318 |
| 13 | Huatuo-Llama-Med-Chinese | 4,970 |
| 14 | DeepKE | 4,415 |
| 15 | text-classification-cnn-rnn | 4,296 |
| 16 | Baichuan2 | 4,104 |
| 17 | Baichuan-13B | 2,932 |
| 18 | OFA | 2,557 |
| 19 | TencentPretrain | 1,087 |
| 20 | xpinyin | 831 |
| 21 | Cornucopia-LLaMA-Fin-Chinese | 658 |
| 22 | rime-cantonese | 658 |
| 23 | GameSentenceMiner | 648 |