Chinese-LLaMA-Alpaca
LLM-Agent-Paper-List
Chinese-LLaMA-Alpaca | LLM-Agent-Paper-List | |
---|---|---|
4 | 4 | |
17,539 | 5,412 | |
- | - | |
8.3 | 8.5 | |
22 days ago | 20 days ago | |
Python | ||
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Chinese-LLaMA-Alpaca
-
Chinese-Alpaca-Plus-13B-GPTQ
I'd like to share with you today the Chinese-Alpaca-Plus-13B-GPTQ model, which is the GPTQ format quantised 4bit models of Yiming Cui's Chinese-LLaMA-Alpaca 13B for GPU reference.
-
How to train a new language that is not in base model?
Could follow what people did with the Chinese-LLaMA, just for Korean. Might want to have a pure Korean corpus before feeding in a translation dataset. How big is it by the way?
- Open Source Chinese LLMs
-
Its possible to fine tune the llama model to better understand another language?
Chinese: https://github.com/ymcui/Chinese-LLaMA-Alpaca
LLM-Agent-Paper-List
What are some alternatives?
ChatGLM2-6B - ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
awesome-llm - Awesome series for Large Language Model(LLM)s
CodeCapybara - Open-source Self-Instruction Tuning Code LLM
ml-surveys - 📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
LLMSurvey - The official GitHub page for the survey paper "A Survey of Large Language Models".
what-would-mother-say - 💁♀️ A Tweet creation Agent that fetches usernames last k tweets and generates a tweet about the requested topic
paxml - Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
Awesome-Text2SQL - Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
Qwen-VL - The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
prompt-engineering - Tips and tricks for working with Large Language Models like OpenAI's GPT-4.
Cornucopia-LLaMA-Fin-Chinese - 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
agent-os - Build autonomous AI agents! 🌞