storium-backend
Cornucopia-LLaMA-Fin-Chinese
storium-backend | Cornucopia-LLaMA-Fin-Chinese | |
---|---|---|
4 | 19 | |
8 | 536 | |
- | - | |
0.0 | 4.4 | |
about 2 years ago | 10 months ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
storium-backend
-
[R] Wordcraft: a Human-AI Collaborative Editor for Story Writing
I’m excited to see where research like this goes next. Though I’m biased considering my research on Storium.
-
[D] Very long sequence data (books) understanding?
I released a dataset of stories that are 19K tokens on average, but the longest are over a million. Our human evaluations show that relevance is the biggest factor in whether authors decide to use model generated text in their story, making this a good platform for assessing long document understanding and generation.
-
[P] Question about generating stories
More recent work tries to learn all of this purely from text. My dataset collected from Storium includes a narrator and annotations, e.g. challenges, goals, etc that can help learn these traits directly from the dataset.
-
[D] Deploying ML models - batching
If you’re willing to roll your own, you can see an example from my latest research project that makes use of asyncio.
Cornucopia-LLaMA-Fin-Chinese
What are some alternatives?
server - The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Baichuan-7B - A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Spectrum - Spectrum is an AI that uses machine learning to generate Rap song lyrics
ray-llm - RayLLM - LLMs on Ray
commit-autosuggestions - A tool that AI automatically recommends commit messages.
tableQA-Chinese - Unsupervised tableQA and databaseQA on chinese finance question and tabular data
GPT2-Chinese - Chinese version of GPT2 training code, using BERT tokenizer.
Chinese-LLaMA-Alpaca - 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
safe-rlhf - Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Huatuo-Llama-Med-Chinese - Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
AtomGPT - 中英文预训练大模型,目标与ChatGPT的水平一致
alignment-handbook - Robust recipes to align language models with human and AI preferences