Login / Signup
Proximal Policy Optimization With Policy Feedback.
Yang Gu
Yuhu Cheng
C. L. Philip Chen
Xuesong Wang
Published in:
IEEE Trans. Syst. Man Cybern. Syst. (2022)
Keyphrases
</>
optimal policy
asymptotically optimal
databases
action selection
data sets
mobile robot
relevance feedback
optimization method
constrained optimization
infinite horizon
expected cost
direct search
management policies