hora
reinforcement-learning-an-introduction
hora | reinforcement-learning-an-introduction | |
---|---|---|
1 | 2 | |
92 | 13,261 | |
- | - | |
4.5 | 2.7 | |
16 days ago | 2 months ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hora
-
Latest Robotics Research Releases ‘Hora’: A Single Policy Capable of Rotating Diverse Objects With a Dexterous Robot Hand
Quick Read: https://www.marktechpost.com/2022/10/17/latest-robotics-research-releases-hora-a-single-policy-capable-of-rotating-diverse-objects-with-a-dexterous-robot-hand/ Paper: https://arxiv.org/pdf/2210.04887.pdf Github: https://github.com/HaozhiQi/hora/ Project: https://haozhi.io/hora/
reinforcement-learning-an-introduction
-
Help request: Are the results of Sutton and Barto's Example 6.6 Cliff walking believable? What's likely the problem if my SARSA implementation can't replicate?
The python code to generate any figure in this textbook is reproduced in a repo, and you can find the file for the figure in question here: https://github.com/ShangtongZhang/reinforcement-learning-an-introduction/blob/master/chapter06/cliff_walking.py
- Reinforcement Learning - looking for some resources
What are some alternatives?
tensor2tensor - Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Reinforcement-Learning-Notebooks - Single notebook implementation of Deep RL algorithms
Carla_The_RL_Self-Driving-Car - Carla_The_RL_Self-Driving Car
reinforcement-learning - Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
PaLM-rlhf-pytorch - Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
TensorLayer - Deep Learning and Reinforcement Learning Library for Scientists and Engineers
dm_control - Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.