or-gym
maro
or-gym | maro | |
---|---|---|
2 | 9 | |
355 | 816 | |
- | 2.0% | |
0.0 | 3.5 | |
7 months ago | 2 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
or-gym
-
Gym like frameworks for combinatorial optimization on Graphs?
How about ORGym: https://github.com/hubbs5/or-gym ?
-
Is there a reinforcement learning method to find stock policy for single echelon inventory system ?
Specifically, inputs to I0 through L should be 1-column arrays: https://github.com/hubbs5/or-gym/blob/d5fbc73623c7b197316d33fba094105953889df3/or_gym/envs/supply_chain/inventory_management.py#L46
maro
-
Headstart for multi-container optimization problem.
Yes, I have actually. Some of them which i could find out were: https://github.com/tryton/tryton/tree/main https://pypi.org/project/pyShipping-python3/ https://github.com/microsoft/maro https://github.com/yat-co/yat-trailer-loading https://github.com/duyet/openerp-6.1.1
- maro: NEW Deep Learning And Reinforcement Learning - star count:609.0
-
What's the outlook of Reinforcement Learning?
As far as current SOTA applications, you can just Google it and find plenty of examples of RL being used outside the realm of games. Video/board games offer a nice domain for research in RL, but the underlying algorithms can be (and have been) applied to plenty of domains outside of this. A big one, currently, is robotics. Another example is resource optimization, which is probably currently being developed, if not used, in a lot of technical domains. As u/daddabarba pointed out, RL can also be used in other areas of AI, like text generation.
What are some alternatives?
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
openerp-6.1.1
ml4vrp - Geometric Deep Learning Models for Vehicle Routing Problem
pyEnigma - Python Enigma cypher machine simulator.
NLNS - Neural Large Neighborhood Search for the Capacitated Vehicle Routing Problem
agents-aea - A framework for autonomous economic agent (AEA) development
DeepBeerInventory-RL - The code for the SRDQN algorithm to train an agent for the beer game problem
VMAgent - Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.
OpenGraphGym
ai-economist - Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
palletier - Palletier is a Python implementation of the solution for the distributer's pallet packing problem
blender-quadcopter-fpv - Quadcopter FPV Simulator for blender to capture epic footage