Short Quantum Circuits in Reinforcement Learning Policies for the Vehicle Routing Problem.
Fabio SanchesSean WeinbergTakanori IdeKazumitsu KamiyaPublished in: CoRR (2021)
Keyphrases
- vehicle routing problem
- reinforcement learning
- optimal policy
- quantum computing
- logic circuits
- metaheuristic
- policy search
- tabu search
- vehicle routing problem with time windows
- markov decision process
- benchmark problems
- routing problem
- test instances
- travel time
- vehicle routing
- combinatorial optimization
- benchmark instances
- hybrid metaheuristic
- np hard
- reward function
- traveling salesman problem
- memetic algorithm
- markov decision processes
- particle swarm optimization
- dynamic programming
- knapsack problem
- multi depot
- optimization problems
- search strategies
- iterated local search
- pick up and delivery
- evolutionary algorithm
- vehicle routing problem with simultaneous
- state space
- variable neighborhood search
- optimal solution
- scheduling problem
- logistics distribution
- learning algorithm
- ant colony optimization
- language model