Login / Signup
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay.
Xingxing Liang
Yang Ma
Yanghe Feng
Zhong Liu
Published in:
CoRR (2021)
Keyphrases
</>
optimization problems
global optimization
optimization algorithm
optimization method
discrete optimization
decision making
low level
dynamic programming
online learning
trajectory data