Our great sponsors
-
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I've had some success using Action Masking - you can refer to here https://github.com/deepmind/open_spiel/blob/120420a74a69354d64c10b51cd129d4587f9f325/open_spiel/python/algorithms/dqn.py but for DQN you need to mask out q values for invalid actions (as well as masking them during prediction). In my case I'm able to place my mask in the observation so can fetch it quite easily during prediction but if that's not possible you could query it from the environment and store it in the replay buffer (like they do in the link I shared)
Related posts
- What projects or open-source contributions can impress Jane Street recruiters for a Quant SWE role ?
- I want to build a learning agent for a combinatorial game
- minimax for imperfect-information turn-games?
- OpenSpiel 1.3 Released!
- What's a good OpenAI Gym Environment for applying centralized multi-agent learning using expected SARSA with tile coding?