rlalgorithms-tf2 VS DeepRL-TensorFlow2

Compare rlalgorithms-tf2 vs DeepRL-TensorFlow2 and see what are their differences.

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
rlalgorithms-tf2 DeepRL-TensorFlow2
18 2
45 573
- -
4.7 0.0
almost 2 years ago almost 2 years ago
Python Python
MIT License Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

rlalgorithms-tf2

Posts with mentions or reviews of rlalgorithms-tf2. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-09-06.

DeepRL-TensorFlow2

Posts with mentions or reviews of DeepRL-TensorFlow2. We have used some of these posts to build our list of alternatives and similar projects.
  • PPO implementation in TensorFlow2
    1 project | /r/reinforcementlearning | 12 Sep 2021
    I've been searching for a clean, good, and understandable implementation of PPO for continuous action space with TF2 witch is understandable enough for me to apply my modifications, but the closest thing that I have found is this code which seems to not work properly even on a simple gym cartpole env (discussed issues in git-hub repo suggest the same problem) so I have some doubts :). I was wondering whether you could recommend an implementation that you trust and suggest :)
  • Question about using tf.stop_gradient in separate Actor-Critic networks for A2C implementation for TF2
    1 project | /r/reinforcementlearning | 24 Mar 2021
    I have been looking at this implementation of A2C. Here the author of the code uses stop_gradient only on the critic network at L90 bur not in the actor network L61 for the continuous case. However , it is used both in actor and critic networks for the discrete case. Can someone explain me why?

What are some alternatives?

When comparing rlalgorithms-tf2 and DeepRL-TensorFlow2 you can also consider the following projects:

stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

soft-actor-critic - Re-implementation of Soft-Actor-Critic (SAC) in TensorFlow 2.0

IRL - Algorithms for Inverse Reinforcement Learning

tensorforce - Tensorforce: a TensorFlow library for applied reinforcement learning

TensorLayer - Deep Learning and Reinforcement Learning Library for Scientists and Engineers

TensorFlow2.0-for-Deep-Reinforcement-Learning - TensorFlow 2.0 for Deep Reinforcement Learning. :octopus:

tf2multiagentrl - Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x

ydata-synthetic - Synthetic data generators for tabular and time-series data

DRL-robot-navigation - Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

minimalRL - Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

agents - TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

machin - Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...