Login / Signup
Trajectory Based Prioritized Double Experience Buffer for Sample-Efficient Policy Optimization.
Shengxiang Li
Ou Li
Guangyi Liu
Siyuan Ding
Yijie Bai
Published in:
IEEE Access (2021)
Keyphrases
</>
neural network
real time
spatio temporal
case study
search algorithm
optimal policy
global optimization
optimization model
trajectory data