Our great sponsors
-
agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
1- Hyperparameter optimization as already suggested by u/sener87 but I think your validation does not have to be change as it tests generalization as far as I understand you right. If you have more parameter/larger search space, you may look into Bayesian optimization for this task as implemented e.g. with tensorflow, torch or numpy frameworks.
2- You could go the reinforcement learning approach by controlling these parameters using an agent. This would mean that the parameters would have to change on the fly, which I am not sure if appropriate. If so, creating a gym environment is not so hard, which would then use something like tf.agents , rlax or any other rl framework of your liking.
2- You could go the reinforcement learning approach by controlling these parameters using an agent. This would mean that the parameters would have to change on the fly, which I am not sure if appropriate. If so, creating a gym environment is not so hard, which would then use something like tf.agents , rlax or any other rl framework of your liking.
Related posts
- Probabilistic forecasting
- How can we model an observation space of an env with different features and sizes.
- [D] Simple model-based RL exercise for master students.
- PPO implementation in TensorFlow2
- tf-agents throws ValueError: Layer dense layer expects 1 input(s), but it received 4 input tensors when using custom environment with OpenAI Gym