-
DeepRL-TensorFlow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I have been looking at this implementation of A2C. Here the author of the code uses stop_gradient only on the critic network at L90 bur not in the actor network L61 for the continuous case. However , it is used both in actor and critic networks for the discrete case. Can someone explain me why?
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
PPO implementation in TensorFlow2
-
Probabilistic forecasting
-
How can we model an observation space of an env with different features and sizes.
-
[D] Simple model-based RL exercise for master students.
-
tf-agents throws ValueError: Layer dense layer expects 1 input(s), but it received 4 input tensors when using custom environment with OpenAI Gym