Self-learning of the robot in 1 hour

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

Proximal-Policy-Optimization

2 1 4.1 Jupyter Notebook

An implementation of Proximal Policy Optimization

In simpler setups (such as simulating a walking ant in MuJoCo), you can feasibly get away with a reward as simple as giving the agent positive reward for moving towards some goal, giving the agent a small, negative reward for not making any forward progress, and giving the agent a large, negative reward for moving away from the goal. The agent simply knows (a) the current angles of it's joints (which it can apply force to) and (b) it's current position relative to the goal. Through a lot of training with these simple rules, the agent can learn to walk towards the goal. Note that it doesn't explicitly learn to walk, it just figures out how to actuate it's joints to move towards the goal as quickly as possible, which, as it turns out, is walking (or, in the case of the example GIF I linked to, more like skipping).

daydreamer

4 220 10.0 Jupyter Notebook

DayDreamer: World Models for Physical Robot Learning

Just saw that our video was posted here. For people interested in the research, here is the project website with the research paper: https://danijar.com/daydreamer

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Reinforcement learning or computer vision

1 project | /r/reinforcementlearning | 11 Mar 2023
Mastering Diverse Domains through World Models - DreamerV3 - Deepmind 2023 - First algorithm to collect diamonds in Minecraft from scratch without human data or curricula! Now with github links!

2 projects | /r/reinforcementlearning | 21 Feb 2023
Sources of Actor Gradients

1 project | /r/reinforcementlearning | 21 Nov 2022
PyDreamer: model-based RL written in PyTorch + integrations with DM Lab and MineRL environments

4 projects | /r/reinforcementlearning | 26 Nov 2021
Google AI, DeepMind And The University of Toronto Introduce DreamerV2, The First Reinforcement Learning (RL) Agent That Outperforms Humans on The Atari Benchmark

1 project | /r/artificial | 23 Feb 2021

This page summarizes the projects mentioned and recommended in the original post on /r/ChatGPT
reinforcement-learning Robotics world-models
Post date: 6 Jun 2023

Proximal-Policy-Optimization

daydreamer

InfluxDB

Related posts

Reinforcement learning or computer vision

Mastering Diverse Domains through World Models - DreamerV3 - Deepmind 2023 - First algorithm to collect diamonds in Minecraft from scratch without human data or curricula! Now with github links!

Sources of Actor Gradients

PyDreamer: model-based RL written in PyTorch + integrations with DM Lab and MineRL environments

Google AI, DeepMind And The University of Toronto Introduce DreamerV2, The First Reinforcement Learning (RL) Agent That Outperforms Humans on The Atari Benchmark