Question about using tf.stop_gradient in separate Actor-Critic networks for A2C implementation for TF2

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

DeepRL-TensorFlow2

2 573 0.0 Python

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

I have been looking at this implementation of A2C. Here the author of the code uses stop_gradient only on the critic network at L90 bur not in the actor network L61 for the continuous case. However , it is used both in actor and critic networks for the discrete case. Can someone explain me why?

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

PPO implementation in TensorFlow2

1 project | /r/reinforcementlearning | 12 Sep 2021
Probabilistic forecasting

1 project | /r/MLQuestions | 24 Apr 2023
How can we model an observation space of an env with different features and sizes.

2 projects | /r/reinforcementlearning | 20 Dec 2022
[D] Simple model-based RL exercise for master students.

1 project | /r/reinforcementlearning | 22 Nov 2021
tf-agents throws ValueError: Layer dense layer expects 1 input(s), but it received 4 input tensors when using custom environment with OpenAI Gym

3 projects | /r/learnmachinelearning | 7 Jul 2021

Question about using tf.stop_gradient in separate Actor-Critic networks for A2C implementation for TF2

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning
Tensorflow Machine Learning reinforcement-learning A2c a3c
Post date: 24 Mar 2021

DeepRL-TensorFlow2

InfluxDB

Related posts

PPO implementation in TensorFlow2

Probabilistic forecasting

How can we model an observation space of an env with different features and sizes.

[D] Simple model-based RL exercise for master students.

tf-agents throws ValueError: Layer dense layer expects 1 input(s), but it received 4 input tensors when using custom environment with OpenAI Gym

Question about using tf.stop_gradient in separate Actor-Critic networks for A2C implementation for TF2

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning Tensorflow Machine Learning reinforcement-learning A2c a3c Post date: 24 Mar 2021

DeepRL-TensorFlow2

InfluxDB

Related posts

PPO implementation in TensorFlow2

Probabilistic forecasting

How can we model an observation space of an env with different features and sizes.

[D] Simple model-based RL exercise for master students.

tf-agents throws ValueError: Layer dense layer expects 1 input(s), but it received 4 input tensors when using custom environment with OpenAI Gym

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning
Tensorflow Machine Learning reinforcement-learning A2c a3c
Post date: 24 Mar 2021