CS-7641-Machine-Learning-Notes
In this repository, I will publish my notes for GaTech's Machine Learning course CS7641. (by mohamedameen93)
stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms (by hill-a)
CS-7641-Machine-Learning-Notes | stable-baselines | |
---|---|---|
1 | 10 | |
197 | 4,000 | |
- | - | |
0.0 | 0.0 | |
over 3 years ago | over 1 year ago | |
Python | ||
- | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
CS-7641-Machine-Learning-Notes
Posts with mentions or reviews of CS-7641-Machine-Learning-Notes.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-01-12.
-
Notes for ML, RL, and CV?
I have notes for ML: https://github.com/mohamedameen93/CS-7641-Machine-Learning-Notes
stable-baselines
Posts with mentions or reviews of stable-baselines.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-08-28.
-
Distributed implementation tips
As underlined by gold-panda, you can give a try with multiprocessing. I once implemented a version based on what is done in stable_baselines v1 (https://github.com/hill-a/stable-baselines/blob/master/stable_baselines/common/vec_env/subproc_vec_env.py)
-
GAIL without actions?
Found relevant code at https://github.com/hill-a/stable-baselines + all code implementations here
-
Best framework to use if learning today
Depends what you wanna do. Universal answer would be https://stable-baselines.readthedocs.io/
-
weird mean reward graph
As you will see here it is recommended to augment this safety measure with target kl_divergence, that will ensure even smoother learning and enforce early stopping to prevent learning collapses.
-
Nvidia ISAAC gym/RL
Code for https://arxiv.org/abs/1707.06347 found: https://github.com/hill-a/stable-baselines
- Bounds for observation
-
Understanding multi agent learning in OpenAI gym and stable-baselines
I haven't read the code, but stable-baselines doesn't support multi-agent environments (https://github.com/hill-a/stable-baselines/issues/423), so I think they're trying to make learning multi-agent easier with Environment.train().
- Using Reinforment Learning to beat the first boss in Dark souls 3 with Proximal Policy Optimization
-
Reinforcement Learning Crash Course (Free)
- https://github.com/hill-a/stable-baselines (Tensorflow)
-
JAX Implementations of Actor-Critic Algorithms
- tf2 speed: https://github.com/hill-a/stable-baselines/issues/576#issuecomment-573331715