reinforcement-learning-an-introduction
hora
reinforcement-learning-an-introduction | hora | |
---|---|---|
2 | 1 | |
13,229 | 88 | |
- | - | |
2.7 | 6.1 | |
about 1 month ago | 5 months ago | |
Python | Python | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
reinforcement-learning-an-introduction
-
Help request: Are the results of Sutton and Barto's Example 6.6 Cliff walking believable? What's likely the problem if my SARSA implementation can't replicate?
The python code to generate any figure in this textbook is reproduced in a repo, and you can find the file for the figure in question here: https://github.com/ShangtongZhang/reinforcement-learning-an-introduction/blob/master/chapter06/cliff_walking.py
- Reinforcement Learning - looking for some resources
hora
-
Latest Robotics Research Releases ‘Hora’: A Single Policy Capable of Rotating Diverse Objects With a Dexterous Robot Hand
Quick Read: https://www.marktechpost.com/2022/10/17/latest-robotics-research-releases-hora-a-single-policy-capable-of-rotating-diverse-objects-with-a-dexterous-robot-hand/ Paper: https://arxiv.org/pdf/2210.04887.pdf Github: https://github.com/HaozhiQi/hora/ Project: https://haozhi.io/hora/
What are some alternatives?
Reinforcement-Learning-Notebooks - Single notebook implementation of Deep RL algorithms
tensor2tensor - Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Carla_The_RL_Self-Driving-Car - Carla_The_RL_Self-Driving Car
reinforcement-learning - Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
PaLM-rlhf-pytorch - Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
TensorLayer - Deep Learning and Reinforcement Learning Library for Scientists and Engineers
dm_control - Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.