-
SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. (by alirezakazemipour)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
It's been 3 weeks since I started learning reinforcement learning and i am stuck at one problem. I am using soft actor critic method from this implementation on GitHub. I made some changes in the model part of the function sample_or_likelihood. Since I am using custom environment which as continuous actions of different action bounds, I changed the clamping part with max and made it according to my custom action bounds.
Related posts
-
Does “massively parallel simulation” help advance Reinforcement Learning?
-
ElegantRL: Cloud-Native Deep Reinforcement Learning
-
Implementation of SAC in custom environment
-
TIL that in 1986 an astronomer trying to trace a 75 cent computer time discrepancy for 10 months eventually found a German hacker selling defense secrets to the KGB
-
Help on what could be wrong on my TD3?