Our great sponsors
-
rlai
This is a Python implementation of concepts and algorithms described in "Reinforcement Learning: An Introduction" (Sutton and Barto, 2018, 2nd edition).
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Coursera_Reinforcement_Learning
Coursera Reinforcement Learning Specialization by University of Alberta & Alberta Machine Intelligence Institute
I've worked through several OpenAI Gym environments with my RL library, which is based almost entirely on the RL textbook by Sutton and Barto (case studies here). No neural networks, nothing too fancy. But I do explore JAX for policy gradient methods / continuous control.
I meant State Of The Art (SOTA) ;) Look here for a simple implementation of Expected Sarsa ( you can also find sarsa on github) https://github.com/LucasBoTang/Coursera_Reinforcement_Learning/blob/master/02Sample-based_Learning_Methods/02Q-Learning_and_Expected_Sarsa.ipynb
Related posts
- [P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!
- TransformerXL + PPO Baseline + MemoryGym
- How does advantage estimation is done when episodes are of variable length in PPO?
- Trying to Train PPO Agent for Pendulum-v0 from Pixel Inputs
- How do I change the maximum number of steps for training