policy-adaptation-during-deployment
dmc2gymnasium
policy-adaptation-during-deployment | dmc2gymnasium | |
---|---|---|
1 | 1 | |
109 | 4 | |
- | - | |
1.8 | 3.4 | |
over 3 years ago | 19 days ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
policy-adaptation-during-deployment
-
Exploring Self-Supervised Policy Adaptation To Continue Training After Deployment Without Using Any Rewards
Code: https://github.com/nicklashansen/policy-adaptation-during-deployment
dmc2gymnasium
-
DM Control Suite vs. Original Environments
I'm actually working on a DMC to gymnasium wrapper right now that you might find useful https://github.com/imgeorgiev/dmc2gymnasium
What are some alternatives?
Ne2Ne-Image-Denoising - Deep Unsupervised Image Denoising, based on Neighbour2Neighbour training
gym-simplegrid - Simple Gridworld Gymnasium Environment
envpool - C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
Gym-Trading-Env - A simple, easy, customizable Gymnasium environment for trading.
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
football - Check out the new game server:
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
pysc2 - StarCraft II Learning Environment
drl_grasping - Deep Reinforcement Learning for Robotic Grasping from Octrees
es_pytorch - High performance implementation of Deep neuroevolution in pytorch using mpi4py. Intended for use on HPC clusters
holodeck - High Fidelity Simulator for Reinforcement Learning and Robotics Research.