Hh-rlhf Alternatives

Similar projects and alternatives to hh-rlhf

text-generation-webui

876 36,293 9.9 Python hh-rlhf VS text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
stanford_alpaca

108 28,816 2.0 Python hh-rlhf VS stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
nebuly

105 8,363 8.4 Python hh-rlhf VS nebuly

The user analytics platform for LLMs
alpaca-lora

107 18,197 3.6 Jupyter Notebook hh-rlhf VS alpaca-lora

Instruct-tune LLaMA on consumer hardware
til

20 976 9.5 HTML hh-rlhf VS til

Today I Learned (by simonw)
awesome-RLHF

6 2,757 7.0 hh-rlhf VS awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)
alpaca-7b-truss

2 317 6.0 Python hh-rlhf VS alpaca-7b-truss
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
LLM-As-Chatbot

3 3,237 9.0 Python hh-rlhf VS LLM-As-Chatbot

LLM as a Chatbot Service
chatllama

3 1,180 2.3 Python hh-rlhf VS chatllama

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better hh-rlhf alternative or higher similarity.

Suggest an alternative to hh-rlhf

hh-rlhf reviews and mentions

Posts with mentions or reviews of hh-rlhf. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-14.

Meta wants its open source AI model to be as capable as OpenAI’s best model
1 project | news.ycombinator.com | 11 Sep 2023

If you ask an LLM to complete a sentence like '[Insert name] stole the fruit (true/false):'
An aligned LLM will be biased towards refusing to answer at all with something like: "I can't tell you because I don't know them."
An "uncensored" LLM will very happily return <"true"> or <"false"> with a probability attached to each. Even OpenAI's GPT-3 does with a low enough temperature.
_
Of course, LLM attention doesn't work like that. The tokens are just a bag of numbers:
- The fact the name 'John' is mentioned in the Bible a lot affects the distribution when you ask if any John stole, because John is always [7554]
- The fact that 'Olf' is part of Adolf and Adolf Hitler is mentioned in a lot of negative sentences will drag the distribution, because 'Olf' is always [4024] and Adolf is always [324, 4024]
You could have asked something with no logical probability difference at all, like:
- 'The store attendant's name was [name], did the child in Long Island drop his ball (true/false):'
And unless you train the model to give you disclaimers it still follows the instruction faithfull and returns true/false with probabilities, demonstrating a deep regression in reasoning...
That's why for models past a certain size, alignment increases performance: https://arxiv.org/abs/2204.05862.
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human
1 project | news.ycombinator.com | 3 Aug 2023
OpenDILab Awesome Paper Collection: RL with Human Feedback （3）
2 projects | /r/u_OpenDILab | 14 May 2023

Title: Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Show HN: ChatLLaMA – A ChatGPT style chatbot for Facebook's LLaMA
10 projects | news.ycombinator.com | 22 Mar 2023

It just hasn't been prompted or fine-tuned to have the neutral, self effacing personality of ChatGPT.
It's doing the pure, "try to guess the most likely next token" task on which they were both trained (https://heartbeat.comet.ml/causal-language-modeling-with-gpt...) (before the reinforcement from human feedback to make them more tool-like https://arxiv.org/abs/2204.05862), with a bit of randomness added for variety's sake (https://huggingface.co/blo1g/how-to-generate).
[D] Is Anthropic influential in research?
1 project | /r/MachineLearning | 30 Dec 2022

They have done good work like releasing their paper and dataset for training an assistant RLHF model. https://github.com/anthropics/hh-rlhf
[R] Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned - Anthropic - Ganguli et al 2022
1 project | /r/MachineLearning | 26 Aug 2022

Github: https://github.com/anthropics/hh-rlhf
A note from our sponsor - SaaSHub
www.saashub.com | 2 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →