DeepTH: Chip Placement with Deep Reinforcement Learning Using a Three-Head Policy Network.
Dengwei ZhaoShuai YuanYanan SunShikui TuLei XuPublished in: DATE (2023)
Keyphrases
- reinforcement learning
- optimal policy
- policy search
- high speed
- peer to peer
- computer networks
- state space
- partially observable environments
- action selection
- network structure
- complex networks
- markov decision processes
- high density
- low cost
- markov decision problems
- dynamic programming
- real time
- network traffic
- network model
- model free
- markov decision process
- control policies