Reinforcement Learning for Solving Stochastic Vehicle Routing Problem with Time Windows.
Zangir IklassovIkboljon SobirovRuben SolozabalMartin TakácPublished in: CoRR (2024)
Keyphrases
- vehicle routing problem with time windows
- reinforcement learning
- waste collection
- multi depot
- set partitioning
- column generation
- vehicle routing problem
- exact algorithms
- stochastic approximation
- benchmark problems
- routing problem
- integer programming
- state space
- markov decision processes
- exact solution
- neural network
- integer program
- genetic programming
- dynamic programming
- bayesian networks