Lm-human-preferences Alternatives

Similar projects and alternatives to lm-human-preferences

dalle-mini

3,446 14,636 5.2 Python lm-human-preferences VS dalle-mini

DALL·E Mini - Generate images from a text prompt
stylegan2-pytorch

1,989 3,613 0.0 Python lm-human-preferences VS stylegan2-pytorch

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
jukebox

129 7,563 0.0 Python lm-human-preferences VS jukebox

Code for the paper "Jukebox: A Generative Model for Music"
RWKV-LM

84 11,579 8.8 Python lm-human-preferences VS RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
nanoGPT

69 31,713 5.4 Python lm-human-preferences VS nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.
gpt-2

63 21,111 2.5 Python lm-human-preferences VS gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"
dalle-2-preview

61 1,049 1.8 lm-human-preferences VS dalle-2-preview
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
DALLE-mtf

41 435 0.0 Python lm-human-preferences VS DALLE-mtf

Open-AI's DALL-E for large scale training in mesh-tensorflow.
gpt-3

39 9,406 3.5 lm-human-preferences VS gpt-3

Discontinued GPT-3: Language Models are Few-Shot Learners
glide-text2im

32 3,467 0.0 Python lm-human-preferences VS glide-text2im

GLIDE: a diffusion-based text-conditional image synthesis model
ChatRWKV

28 9,276 8.3 Python lm-human-preferences VS ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
GLM-130B

19 7,607 4.8 Python lm-human-preferences VS GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
trl

13 8,023 9.6 Python lm-human-preferences VS trl

Train transformer language models with reinforcement learning.
sentencepiece

19 9,451 8.3 C++ lm-human-preferences VS sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.
v-diffusion-pytorch

10 690 0.0 Python lm-human-preferences VS v-diffusion-pytorch

v objective diffusion inference code for PyTorch.
community-events

8 375 7.2 Jupyter Notebook lm-human-preferences VS community-events

Place where folks can contribute to 🤗 community events
tensorrtx

3 6,556 8.0 C++ lm-human-preferences VS tensorrtx

Implementation of popular deep learning networks with TensorRT network definition API
bevy_retro

5 294 4.0 Rust lm-human-preferences VS bevy_retro

Plugin pack for making 2D games with Bevy
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better lm-human-preferences alternative or higher similarity.

Suggest an alternative to lm-human-preferences

lm-human-preferences reviews and mentions

Posts with mentions or reviews of lm-human-preferences. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-14.

Ask HN: Open-source GPT-3 alternatives
4 projects | news.ycombinator.com | 14 Feb 2023
El éxito continuo de OpenAI: Y como llegaron a crear la IA más avanzada del 2023. ChatGPT.
2 projects | dev.to | 2 Feb 2023
Sam Altman on the best and worst case scenario for AI - "...the good case is just so unbelievably good that you sound like a really crazy person to start talking about it."
1 project | /r/singularity | 21 Jan 2023

Lest you think that that sounds like a too galaxy-brained possibility, it has already happened at OpenAI (scroll down to "Bugs can optimize for bad behavior"), just with a model that was very far from being capable enough to be dangerous.
Value head in GPT2
2 projects | /r/reinforcementlearning | 4 Oct 2022

Found relevant code at https://github.com/openai/lm-human-preferences + all code implementations here
Should we stick to the devil we know?
2 projects | /r/TheMotte | 11 Aug 2022

That's why, when they're serious, they use RL for finetuning from human preferences (would be hilarious if this attempt to solve the terrible bias you take to be evidence of AGI threat ends up creating a Woke Singleton itself, btw); it's a powerful general approach, and I see no sign of it being applied here.
Dall-E 2
16 projects | news.ycombinator.com | 6 Apr 2022

The kind of measures they are taking, like simply deleting wholesale anything problematic, don't really have a '-1'.
But amusingly, exactly that did happen in one of their GPT experiments! https://openai.com/blog/fine-tuning-gpt-2/
Discussion Thread
1 project | /r/neoliberal | 5 Apr 2022
[D] Applications for using reinforcement learning to fine-tune GPT-2
2 projects | /r/MachineLearning | 19 Mar 2022

Code for https://arxiv.org/abs/1909.08593 found: https://github.com/openai/lm-human-preferences
A note from our sponsor - InfluxDB
www.influxdata.com | 24 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →