A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Why do you think that https://github.com/uvipen/Tetris-deep-Q-learning-pytorch is a good alternative to pytorch-learn-reinforcement-learning