DD-PPO, TD3, SAC: which is the best?

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • habitat-api

    Discontinued A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators. [Moved to: https://github.com/facebookresearch/habitat-lab]

  • Code for https://arxiv.org/abs/1911.00357 found: https://github.com/facebookresearch/habitat-api

  • pbt

    Experiments with DeepMind's Population Based Training

  • Code for https://arxiv.org/abs/1711.09846 found: https://github.com/cipher813/pbt

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • agents

    TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

  • Depending on what task you pick the "best" algo will vary. There are also a bunch of variations and tricks for each of those, some of which have been given new names over time. If you are working on a project I would suggest whichever one has the simplest and most extendable implementation. If you really want to compare all of them you can use libraries that have them all implemented, such as tfagents.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts