Meta-SAC

Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020 (by twni2016)

Meta-SAC Alternatives

Similar projects and alternatives to Meta-SAC based on common topics and language

  • f-IRL

    Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020

  • autonomous-learning-library

    A PyTorch library for building deep reinforcement learning agents.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • tianshou

    An elegant PyTorch deep reinforcement learning library.

  • pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • AgileRL

    Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Meta-SAC alternative or higher similarity.

Meta-SAC reviews and mentions

Posts with mentions or reviews of Meta-SAC. We have used some of these posts to build our list of alternatives and similar projects.
  • Do policy gradient methods also require some mechanism for exploration?
    1 project | /r/reinforcementlearning | 1 Apr 2022
    A simple approach that can help is a linear entropy schedule: Start at a high value to explore early, decay over time to learn a more optimal policy. Some variants of SAC autotune the entropy over time. A more advanced approach is AGAC, which does something like a GAN to encourage the PPO/A2C policy to explore by forcing it to be less predictable. There are many approaches, these are just a sample

Stats

Basic Meta-SAC repo stats
1
28
0.0
almost 3 years ago

twni2016/Meta-SAC is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of Meta-SAC is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com