Login / Signup
Hierarchical Decision and Control for Continuous Multitarget Problem: Policy Evaluation With Action Delay.
Jiangcheng Zhu
Jun Zhu
Zhepei Wang
Shan Guo
Chao Xu
Published in:
IEEE Trans. Neural Networks Learn. Syst. (2019)
Keyphrases
</>
policy evaluation
least squares
monte carlo
decision making
action selection
temporal difference
reinforcement learning
control system
learning algorithm
function approximation
decision process
policy iteration
decision makers
sufficient conditions
model selection
decision problems
model free