Our great sponsors
-
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Try using an imitation learning algorithm. Two popular options are MaxEnt IRL and GAIL. This repository has GAIL implementation and this repository has MaxEnt IRL and GAIL implementation. There are other implementations too that you can check out.
Try using an imitation learning algorithm. Two popular options are MaxEnt IRL and GAIL. This repository has GAIL implementation and this repository has MaxEnt IRL and GAIL implementation. There are other implementations too that you can check out.
Related posts
- Exploring Self-Supervised Policy Adaptation To Continue Training After Deployment Without Using Any Rewards
- [P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
- TransformerXL + PPO Baseline + MemoryGym
- What's the best "Non-Black Box" framework for SOTA algorithms?
- Try simple interfaces and customized driving policy and casezoo set on DI-driveļ¼