chappie.ai
Super-mario-bros-PPO-pytorch
Our great sponsors
chappie.ai | Super-mario-bros-PPO-pytorch | |
---|---|---|
4 | 1 | |
20 | 950 | |
- | - | |
1.3 | 0.0 | |
6 months ago | over 2 years ago | |
Jupyter Notebook | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
chappie.ai
- Playing Chess By Combining MuZero and Perceiver IO
- [D] A chess game
-
Multi-task learning: How's that done?
I use the second approach in a MuZero like bot if you are interested in an example with code https://medium.com/mlearning-ai/playing-chess-with-a-generalized-ai-b83d64ac71fe. The code can be found here: https://github.com/bellerb/chappie.ai/blob/main/ai/bot.py.
-
Teaching A Generalized AI Chess
full code link: https://github.com/bellerb/chappie.ai
Super-mario-bros-PPO-pytorch
-
[AI application] AI agent plays Contra
if you are interested in Super mario bros, here you are https://github.com/uvipen/Super-mario-bros-PPO-pytorch
What are some alternatives?
quickai - QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
OpenPrompt - An Open-Source Framework for Prompt-Learning.
muzero-general - MuZero
pytorch-learn-reinforcement-learning - A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
chess - Program for playing chess in the console against AI or human opponents
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
NLP-With-PyTorch - My NLP experiments using PyTorch to solve some common NLP problems with advanced and state of the art deep learning techniques.
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
pybullet-gym - Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.