A simple implementation of "Adaptive Policy Iteration" using Google's JAX and Deepmind "bsuite". This approximate policy iteration scheme treats the value-function as losses. (arXiv:2002.03069)

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

adaptive-policy-iteration

2 5 2.6 Python

JAX implementation of Adaptive Approximate Policy Iteration (Hao et al., 2021)
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

A simple implementation of "Adaptive Policy Iteration" using Google's JAX and Deepmind "bsuite". This approximate policy iteration scheme treats the value-function as losses. (arXiv:2002.03069)

1 project | /r/reinforcementlearning | 8 Jun 2021
About Monte Carlo tree search in Jax

1 project | news.ycombinator.com | 23 Nov 2023
4000x Speedup in Reinforcement Learning with Jax

1 project | news.ycombinator.com | 7 Apr 2023
Physic engine for 3D simulation: which one to use?

1 project | /r/reinforcementlearning | 8 Oct 2022
Brax vs TDS for differentiable rigid body dynamics

2 projects | /r/robotics | 11 Sep 2022

A simple implementation of "Adaptive Policy Iteration" using Google's JAX and Deepmind "bsuite". This approximate policy iteration scheme treats the value-function as losses. (arXiv:2002.03069)

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
reinforcement-learning adaptive-policy-iteration Jax bsuite efficient-exploration
Post date: 8 Jun 2021

adaptive-policy-iteration

InfluxDB

Related posts