Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Lm-human-preferences Alternatives
Similar projects and alternatives to lm-human-preferences
-
stylegan2-pytorch
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
lm-human-preferences reviews and mentions
- Ask HN: Open-source GPT-3 alternatives
- El éxito continuo de OpenAI: Y como llegaron a crear la IA más avanzada del 2023. ChatGPT.
-
Sam Altman on the best and worst case scenario for AI - "...the good case is just so unbelievably good that you sound like a really crazy person to start talking about it."
Lest you think that that sounds like a too galaxy-brained possibility, it has already happened at OpenAI (scroll down to "Bugs can optimize for bad behavior"), just with a model that was very far from being capable enough to be dangerous.
-
Value head in GPT2
Found relevant code at https://github.com/openai/lm-human-preferences + all code implementations here
-
Should we stick to the devil we know?
That's why, when they're serious, they use RL for finetuning from human preferences (would be hilarious if this attempt to solve the terrible bias you take to be evidence of AGI threat ends up creating a Woke Singleton itself, btw); it's a powerful general approach, and I see no sign of it being applied here.
-
Dall-E 2
The kind of measures they are taking, like simply deleting wholesale anything problematic, don't really have a '-1'.
But amusingly, exactly that did happen in one of their GPT experiments! https://openai.com/blog/fine-tuning-gpt-2/
- Discussion Thread
-
[D] Applications for using reinforcement learning to fine-tune GPT-2
Code for https://arxiv.org/abs/1909.08593 found: https://github.com/openai/lm-human-preferences
-
A note from our sponsor - InfluxDB
www.influxdata.com | 24 Apr 2024
Stats
openai/lm-human-preferences is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of lm-human-preferences is Python.
Popular Comparisons
- lm-human-preferences VS trl
- lm-human-preferences VS GLM-130B
- lm-human-preferences VS dalle-mini
- lm-human-preferences VS tensorrtx
- lm-human-preferences VS glide-text2im
- lm-human-preferences VS gpt-2
- lm-human-preferences VS dalle-2-preview
- lm-human-preferences VS sentencepiece
- lm-human-preferences VS community-events
- lm-human-preferences VS gpt-3
Sponsored