-
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
trlX library here: https://github.com/CarperAI/trlx
Found relevant code at https://github.com/openai/summarize-from-feedback + all code implementations here
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
Recapping the AI, Machine Learning and Data Science Meetup — May 2, 2024
-
Show HN: An end-to-end reinforcement learning library for infinite horizon tasks
-
Problem with Truncated Quantile Critics (TQC) and n-step learning algorithm.
-
[P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)
-
SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), <class 'numpy.float32'>) observation space is not supported