gym-battleship
rl-baselines-zoo
Our great sponsors
gym-battleship | rl-baselines-zoo | |
---|---|---|
2 | 2 | |
9 | 1,106 | |
- | - | |
2.4 | 0.0 | |
about 1 year ago | over 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gym-battleship
-
Battleship
There are a few gyms for battleship, https://github.com/thomashirtz/gym-battleship ... some variations I would like to see involve a much bigger grid (100x100) and moving ships along axis @ various rates ... but then you would get a different game I guess.
-
Python OpenAI Gym environment for reinforcement learning
(shamless plug) If you want to check out a simple custom gym environment of a boardgame, I did a battleship env some time ago: https://github.com/thomashirtz/gym-battleship
rl-baselines-zoo
-
Agent trains great with PPO but terrible with SAC --> Advice for Hyperparameters
Take a look at these tuned sets of hyperparameters for various problems in PPO and SAC. The batch sizes are WAY smaller regardless of the problem. Your initial learning rate may also be too high.
-
How do I convert zoo / gym trained models to TensorFlow Lite or PyTorch TorchScript?
https://github.com/araffin/rl-baselines-zoo (TensorFlow based, using https://github.com/hill-a/stable-baselines)
What are some alternatives?
open_spiel - OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
rl-baselines3-zoo - A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Minigrid - Simple and easily configurable grid world environments for reinforcement learning
rex-gym - OpenAI Gym environments for an open-source quadruped robot (SpotMicro)
seed_rl - SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
pybullet-gym - Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.
portfolio-management - Repository for portfolio management using Pytorch, SQLAlchemy and XArray. The management is done using the reinforcement learning algorithm "Soft Actor-Critic".
pytorch-blender - :sweat_drops: Seamless, distributed, real-time integration of Blender into PyTorch data pipelines
ideas - :rocket: Ideas for everyone under a CC licence. Feel free to use. I'll send you a postcard if you build anything on this list.