Circuit Routing Using Monte Carlo Tree Search and Deep Reinforcement Learning.
Youbiao HeHebi LiJin TianForrest Sheng BaoPublished in: VLSI-DAT (2022)
Keyphrases
- monte carlo tree search
- reinforcement learning
- temporal difference
- reinforcement learning methods
- bayesian reinforcement learning
- monte carlo
- temporal difference learning
- tree search algorithm
- evaluation function
- function approximation
- reinforcement learning algorithms
- state space
- model free
- learning algorithm
- optimal policy
- optimal control
- control problems
- game tree
- policy iteration
- shortest path
- supervised learning
- learning process
- monte carlo search
- action selection
- step size
- dynamic programming
- neural network
- alpha beta search