chainerrl
Deep-Reinforcement-Learning-Hands-On
Our great sponsors
chainerrl | Deep-Reinforcement-Learning-Hands-On | |
---|---|---|
3 | 2 | |
1,141 | 2,746 | |
0.0% | 0.7% | |
0.0 | 0.0 | |
over 2 years ago | about 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
chainerrl
-
Help with my PyTorch implementation of PPO
Code for https://arxiv.org/abs/1709.06560 found: https://github.com/chainer/chainerrl
-
Any working Acer implementation for continuous action space?
I implemented my version of Acer that supports discrete action space. I need to add an extension that supports continuous action space. I've seen a couple of implementations here and here. The first doesn't work for PongNoFrameskip-v4 and the other doesn't work in macOS.
-
Beginner attempting to implement Noisy DQN
I tried all the versions I found and in most of them the network couldn't even learn to set the sigma as 0 (or close). The only implementation where I actually got improvement was by changing the noise directly when calling the noisy layers in this git. I don't know if this is the correct way but it sure showed good results.
Deep-Reinforcement-Learning-Hands-On
-
A2C/PPO with continuous action space
In some methods, like the one here, the actor network has two heads, one for the mean and one for the variance. In other methods, like the one here, the network only outputs the mean, while the variance is pre-defined and is decaying throughout the training.
-
Beginner attempting to implement Noisy DQN
https://github.com/PacktPublishing/Deep-Reinforcement-Learning-Hands-On/blob/baa9d013596ea8ea8ed6826b9de6679d98b897ca/Chapter07/lib/dqn_model.py#L9
What are some alternatives?
TensorLayer - Deep Learning and Reinforcement Learning Library for Scientists and Engineers
PPO-PyTorch - Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
machin - Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
TensorFlow2.0-for-Deep-Reinforcement-Learning - TensorFlow 2.0 for Deep Reinforcement Learning. :octopus:
DeepRL-TensorFlow2 - 🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
deep-q-learning - Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
DeepLearning - Contains all my works, references for deep learning
fundamentalRL - educational codebase demonstrating some of the most common RL algorithms
acer - PyTorch implementation of both discrete and continuous ACER