Login / Signup

Trajectory Based Prioritized Double Experience Buffer for Sample-Efficient Policy Optimization.

Shengxiang LiOu LiGuangyi LiuSiyuan DingYijie Bai
Published in: IEEE Access (2021)
Keyphrases
  • neural network
  • real time
  • spatio temporal
  • case study
  • search algorithm
  • optimal policy
  • global optimization
  • optimization model
  • trajectory data