Deep Reinforcement Learning for the Capacitated Vehicle Routing Problem with Soft Time Window.

Xiaohe Wang Xinli Shi

Published in: WCSP (2022)

Keyphrases

reinforcement learning
function approximation
model free
state space
metaheuristic
reinforcement learning algorithms
ant colony optimization
supervised learning
optimal policy
markov decision processes
learning problems
memetic algorithm
vehicle routing problem
multi agent
optimal control
temporal difference
robotic control
neural network
action selection
linear programming
e learning