safe-rlhf vs alignment-handbook

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback (by PKU-Alignment)

Source Code

pku-beaver.github.io

Suggest alternative

Edit details

alignment-handbook

Robust recipes to align language models with human and AI preferences (by huggingface)

llm rlhf Transformers

Source Code

huggingface.co

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

safe-rlhf		alignment-handbook
	Project
1	Mentions	3
1,160	Stars	3,844
4.5%	Growth	6.9%
8.1	Activity	8.6
20 days ago	Latest Commit	9 days ago
Python	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

safe-rlhf

Posts with mentions or reviews of safe-rlhf. We have used some of these posts to build our list of alternatives and similar projects.

[R] Meet Beaver-7B: a Constrained Value-Aligned LLM via Safe RLHF Technique
1 project | /r/MachineLearning | 16 May 2023

alignment-handbook

Posts with mentions or reviews of alignment-handbook. We have used some of these posts to build our list of alternatives and similar projects.

Recipes to align LLMs with AI feedback
1 project | news.ycombinator.com | 3 Mar 2024
What on-demand GPU service would you recommend to do fine-tuning of 7B models ?
1 project | /r/LocalLLaMA | 8 Dec 2023

I'd like to run some fine-tuning experiments on 7B models. Specifically, interested to use https://github.com/huggingface/alignment-handbook and run Zephyr-7b recipes on custom datasets. Don't have any viable GPU locally.
Zephyr 7B β Released
1 project | /r/llm_updated | 29 Oct 2023

What are some alternatives?

When comparing safe-rlhf and alignment-handbook you can also consider the following projects:

LLMSurvey - The official GitHub page for the survey paper "A Survey of Large Language Models".

opening-up-chatgpt.github.io - Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Tracking Openness, Transparency, and Accountability in Instruction-Tuned Text Generators.” In Proceedings of the 5th International Conference on Conversational User Interfaces. doi:10.1145/3571884.3604316.

CodeCapybara - Open-source Self-Instruction Tuning Code LLM

WebGLM - WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

AtomGPT - 中英文预训练大模型，目标与ChatGPT的水平一致

argilla - Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.

Cornucopia-LLaMA-Fin-Chinese - 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

ray-llm - RayLLM - LLMs on Ray

h2o-wizardlm - Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning

peft - 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

safe-rlhf vs LLMSurvey alignment-handbook vs opening-up-chatgpt.github.io safe-rlhf vs CodeCapybara alignment-handbook vs WebGLM safe-rlhf vs AtomGPT alignment-handbook vs argilla safe-rlhf vs opening-up-chatgpt.github.io alignment-handbook vs Cornucopia-LLaMA-Fin-Chinese safe-rlhf vs ray-llm alignment-handbook vs LLMSurvey safe-rlhf vs h2o-wizardlm alignment-handbook vs peft

Compare safe-rlhf vs alignment-handbook and see what are their differences.

safe-rlhf

alignment-handbook

safe-rlhf

alignment-handbook

What are some alternatives?