Deep Reinforcement Learning for the Capacitated Vehicle Routing Problem with Soft Time Window.
Xiaohe WangXinli ShiPublished in: WCSP (2022)
Keyphrases
- reinforcement learning
- function approximation
- model free
- state space
- metaheuristic
- reinforcement learning algorithms
- ant colony optimization
- supervised learning
- optimal policy
- markov decision processes
- learning problems
- memetic algorithm
- vehicle routing problem
- multi agent
- optimal control
- temporal difference
- robotic control
- neural network
- action selection
- linear programming
- e learning