es_pytorch
Super-mario-bros-PPO-pytorch
Our great sponsors
es_pytorch | Super-mario-bros-PPO-pytorch | |
---|---|---|
1 | 1 | |
23 | 970 | |
- | - | |
0.0 | 0.0 | |
over 2 years ago | almost 3 years ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
es_pytorch
-
What is the greatest achievement of Genetic Algorithms[D]?
ES, specifically OpenAI's ES (and to an extent CMA-ES). This has been shown to be very competitive with modern state of the art RL algorithms. A huge benefit of it is that it's incredibly easy to implement (I'm gonna shamelessly plug my implementation if you want to see the inner workings)
Super-mario-bros-PPO-pytorch
-
[AI application] AI agent plays Contra
if you are interested in Super mario bros, here you are https://github.com/uvipen/Super-mario-bros-PPO-pytorch
What are some alternatives?
muzero-general - MuZero
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
pureples - Pure Python Library for ES-HyperNEAT. Contains implementations of HyperNEAT and ES-HyperNEAT.
neat-python - Python implementation of the NEAT neuroevolution algorithm
pytorch-learn-reinforcement-learning - A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
policy-adaptation-during-deployment - Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
pybullet-gym - Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.
chess - Program for playing chess in the console against AI or human opponents
discord-openai-bot - A Discord chatbot that uses OpenAI's API to generate conversation.