Improving Generalization of Deep Reinforcement Learning-based TSP Solvers.
Wenbin OuyangYisen WangShaochen HanZhejian JinPaul WengPublished in: SSCI (2021)
Keyphrases
- reinforcement learning
- traveling salesman problem
- state space
- function approximation
- genetic algorithm
- learning algorithm
- machine learning
- optimal solution
- optimal policy
- temporal difference learning
- reinforcement learning algorithms
- combinatorial optimization
- policy search
- markov decision process
- action selection
- optimal control
- ant colony optimization
- np hard
- active learning
- multi agent