stable-baselines
open-ai
Our great sponsors
stable-baselines | open-ai | |
---|---|---|
10 | 18 | |
4,000 | 2,139 | |
- | - | |
0.0 | 7.7 | |
over 1 year ago | about 1 month ago | |
Python | PHP | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
stable-baselines
-
Distributed implementation tips
As underlined by gold-panda, you can give a try with multiprocessing. I once implemented a version based on what is done in stable_baselines v1 (https://github.com/hill-a/stable-baselines/blob/master/stable_baselines/common/vec_env/subproc_vec_env.py)
-
GAIL without actions?
Found relevant code at https://github.com/hill-a/stable-baselines + all code implementations here
-
Best framework to use if learning today
Depends what you wanna do. Universal answer would be https://stable-baselines.readthedocs.io/
-
weird mean reward graph
As you will see here it is recommended to augment this safety measure with target kl_divergence, that will ensure even smoother learning and enforce early stopping to prevent learning collapses.
-
Nvidia ISAAC gym/RL
Code for https://arxiv.org/abs/1707.06347 found: https://github.com/hill-a/stable-baselines
- Bounds for observation
-
Understanding multi agent learning in OpenAI gym and stable-baselines
I haven't read the code, but stable-baselines doesn't support multi-agent environments (https://github.com/hill-a/stable-baselines/issues/423), so I think they're trying to make learning multi-agent easier with Environment.train().
- Using Reinforment Learning to beat the first boss in Dark souls 3 with Proximal Policy Optimization
-
Reinforcement Learning Crash Course (Free)
- https://github.com/hill-a/stable-baselines (Tensorflow)
-
JAX Implementations of Actor-Critic Algorithms
- tf2 speed: https://github.com/hill-a/stable-baselines/issues/576#issuecomment-573331715
open-ai
- ChatGPT stream supported PHP-Laravel API
- ChatGPT Plus or ChatGPT API
- Added ChatGPT (chat-completions) API to OpenAI package
-
ChatGPT and Whisper APIs
Php library
https://github.com/orhanerday/open-ai#chat-as-known-as-chatg...
-
Ask HN: Any tips for promoting an open source package on Hacker News?
Hi everyone,
I released an open source PHP package that leverages the power of OpenAI's language model to enhance natural language processing in PHP projects. The package has already gained some traction with over 60k downloads on GitHub, but I'm hoping to increase its exposure and reach a wider audience.
I'm wondering if anyone has successfully promoted an open source package on Hacker News before and has any tips to share. What worked for you? What didn't work? How did you craft your title and messaging to catch the attention of the Hacker News community?
If you're interested in checking out the package, it's available on GitHub here: https://github.com/orhanerday/open-ai
Any feedback or advice would be greatly appreciated! Thanks in advance for your help.
- I have 60k downloads but got only 900 GitHub stars for my package
- Show HN: Open-source ChatBot supports server-sent event
- OpenAI PHP!
- OpenAI PHP
What are some alternatives?
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
gpt-neox - An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Ray - Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
discord-openai-bot - A Discord chatbot that uses OpenAI's API to generate conversation.
rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
ControlNet - Let us control diffusion models!
Super-mario-bros-PPO-pytorch - Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
gpt-3-simple-tutorial - Generate SQL from Natural Language Sentences using OpenAI's GPT-3 Model
Tic-Tac-Toe-Gym - This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning
client - ⚡️ OpenAI PHP is a supercharged community-maintained PHP API client that allows you to interact with OpenAI API.
DI-engine - OpenDILab Decision AI Engine
openai-cookbook - Examples and guides for using the OpenAI API