valhalla-nmt
RL4LMs
valhalla-nmt | RL4LMs | |
---|---|---|
1 | 5 | |
26 | 2,096 | |
- | 2.3% | |
0.9 | 0.0 | |
about 1 year ago | 3 months ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
valhalla-nmt
-
Meet ‘VALHALLA’, a Machine Learning Method That can Hallucinate an Image of Written Words and Then Use It to Help Translate The Text into Another Language
Continue reading | Check out the paper, github, project and post
RL4LMs
-
How To Setup a Model With Guardrails?
I think of guardrails as another dimension of human preferences: whether you are training a model to answer questions more gooder or avoid saying horrifying stuff, you are teaching the model a preference. So I thinks it's a straightforward RLHF problem but from a different perspective.
-
OpenDILab Awesome Paper Collection: RL with Human Feedback (2)
Found relevant code at https://github.com/allenai/rl4lms + all code implementations here
-
Best option for creating a custom GPT AI
If you skip down ther s several open source options. https://github.com/allenai/RL4LMs is an example.
- Will we ever see an open source alternative to ChatGPT?
- An Open-Source Version of ChatGPT is Coming [News]
What are some alternatives?
pykale - Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem. ⭐ Star to support our work!
trlx - A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Dromedary - Dromedary: towards helpful, ethical and reliable LLMs.
edenai-apis - Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
PaLM-rlhf-pytorch - Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Open-Assistant - OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
awesome-transformer-nlp - A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
NeMo-Guardrails - NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
ColossalAI - Making large AI models cheaper, faster and more accessible