rl-baselines-zoo
gym-battleship
rl-baselines-zoo | gym-battleship | |
---|---|---|
2 | 2 | |
1,106 | 9 | |
- | - | |
0.0 | 2.4 | |
over 1 year ago | about 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
rl-baselines-zoo
-
Agent trains great with PPO but terrible with SAC --> Advice for Hyperparameters
Take a look at these tuned sets of hyperparameters for various problems in PPO and SAC. The batch sizes are WAY smaller regardless of the problem. Your initial learning rate may also be too high.
-
How do I convert zoo / gym trained models to TensorFlow Lite or PyTorch TorchScript?
https://github.com/araffin/rl-baselines-zoo (TensorFlow based, using https://github.com/hill-a/stable-baselines)
gym-battleship
-
Battleship
There are a few gyms for battleship, https://github.com/thomashirtz/gym-battleship ... some variations I would like to see involve a much bigger grid (100x100) and moving ships along axis @ various rates ... but then you would get a different game I guess.
-
Python OpenAI Gym environment for reinforcement learning
(shamless plug) If you want to check out a simple custom gym environment of a boardgame, I did a battleship env some time ago: https://github.com/thomashirtz/gym-battleship
What are some alternatives?
rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Minigrid - Simple and easily configurable grid world environments for reinforcement learning
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
rex-gym - OpenAI Gym environments for an open-source quadruped robot (SpotMicro)
pybullet-gym - Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
pytorch-blender - :sweat_drops: Seamless, distributed, real-time integration of Blender into PyTorch data pipelines
portfolio-management - Repository for portfolio management using Pytorch, SQLAlchemy and XArray. The management is done using the reinforcement learning algorithm "Soft Actor-Critic".
ideas - :rocket: Ideas for everyone under a CC licence. Feel free to use. I'll send you a postcard if you build anything on this list.