policy-adaptation-during-deployment
Super-mario-bros-PPO-pytorch
Our great sponsors
policy-adaptation-during-deployment | Super-mario-bros-PPO-pytorch | |
---|---|---|
1 | 1 | |
109 | 970 | |
- | - | |
1.8 | 0.0 | |
over 3 years ago | almost 3 years ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
policy-adaptation-during-deployment
-
Exploring Self-Supervised Policy Adaptation To Continue Training After Deployment Without Using Any Rewards
Code: https://github.com/nicklashansen/policy-adaptation-during-deployment
Super-mario-bros-PPO-pytorch
-
[AI application] AI agent plays Contra
if you are interested in Super mario bros, here you are https://github.com/uvipen/Super-mario-bros-PPO-pytorch
What are some alternatives?
Ne2Ne-Image-Denoising - Deep Unsupervised Image Denoising, based on Neighbour2Neighbour training
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
envpool - C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
muzero-general - MuZero
pytorch-learn-reinforcement-learning - A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
drl_grasping - Deep Reinforcement Learning for Robotic Grasping from Octrees
dmc2gymnasium - Gymnasium integration for the DeepMind Control (DMC) suite
pybullet-gym - Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.