encode-attend-navigate-pytorch
Encode-attend-navigate unofficial Pytorch implementation (by astariul)
stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code (by Stable-Baselines-Team)
encode-attend-navigate-pytorch | stable-baselines3-contrib | |
---|---|---|
4 | 6 | |
11 | 635 | |
- | 1.4% | |
2.5 | 5.9 | |
11 months ago | 22 days ago | |
Python | Python | |
- | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
encode-attend-navigate-pytorch
Posts with mentions or reviews of encode-attend-navigate-pytorch.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-04-25.
-
[Pytorch reimplementation] Encode-Attend-Navigate, a RL-based TSP solver;
https://github.com/astariul/encode-attend-navigate-pytorch I recently re-implemented encode-attend-navigate, a TSP solver based on RL. The official repo was using tensorflow 1.x, so I decided tore-implement it at Pytorch. I wanted to share it here to get some opinion :) You can train the model using a free GPU from Google Colab, a Colab notebook is provided in the README !
- [P] Pytorch reimplementation of Encode-Attend-Navigate, a RL-based TSP solver
-
[P] Reinforcement Learning with multiple simultaneous actions?
Pytorch : encode-attend-navigate-pytorch
- [Pytorch reimplementation] Encode-Attend-Navigate, a RL-based TSP solver
stable-baselines3-contrib
Posts with mentions or reviews of stable-baselines3-contrib.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-12-09.
-
Problem with Truncated Quantile Critics (TQC) and n-step learning algorithm.
# https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/sb3_contrib/tqc/tqc.py :
-
Understanding Action Masking in RLlib
Here's a theoretical overview and an implementation of action masking for PPO.
-
PPO rollout buffer for turn-based two-player game with varying turn lengths
Simplified version of rollout collection (adapted from ppo_mask.py line 282):
-
GitHub Copilot: your AI pair programmer
Transformers (GPT-3) aren't quite _supervised_, but it does require valid samples.
Agree 100% with RL being the path forward. You probably have already seen ( https://venturebeat.com/2021/06/09/deepmind-says-reinforceme... ). Personally I'm really stoked for this https://github.com/Stable-Baselines-Team/stable-baselines3-c... , which will make it a lot easier for rubes like me to use RL.
-
[P] Stable-Baselines3 v1.0 - Reliable implementations of RL algorithms
But as we already have vanilla DQN and QR-DQN (in our contrib repo: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib ) I think it is already a good start for off-policy discrete action algorithms. (QR-DQN is usually competitive vs DQN+extensions)
What are some alternatives?
When comparing encode-attend-navigate-pytorch and stable-baselines3-contrib you can also consider the following projects:
som-tsp - Solving the Traveling Salesman Problem using Self-Organizing Maps
muzero-general - MuZero
HybridTSPSolver - A hybrid TSP solver that I made for my master's degree thesis in computer science.
dreamerv2 - Mastering Atari with Discrete World Models
VeRyPy - A python library with implementations of 15 classical heuristics for the capacitated vehicle routing problem.
TabNine - AI Code Completions