-
agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I followed the TensorFlow tutorial for agents and the multi armed bandit tutorial and now I'm trying to make one of the already implemented agents, from the examples, work on my own environment. Basically my environment exists of 5 actions and 5 observations. Applying one action i results in the same state i. One action contains another step of sending that action number to a different program via a socket and the answer from the program is interpreted for the reward. My environment seems to be working, I used the little test script below to test the observe and action functions. I know this is not a full proof but showed its atleast working.
Related posts
-
Competitive reinforcement learning for turn-based games
-
a3c_trading: NEW Deep Learning And Reinforcement Learning - star count:392.0
-
a3c_trading: NEW Deep Learning And Reinforcement Learning - star count:392.0
-
a3c_trading: NEW Deep Learning And Reinforcement Learning - star count:392.0
-
a3c_trading: NEW Deep Learning And Reinforcement Learning - star count:392.0