Login / Signup
Learning Policies for Markov Decision Processes From Data.
Manjesh Kumar Hanawal
Hao Liu
Henghui Zhu
Ioannis Ch. Paschalidis
Published in:
IEEE Trans. Autom. Control. (2019)
Keyphrases
</>
markov decision processes
reinforcement learning
learning algorithm
optimal policy
probability distribution
learning tasks
partially observable
average cost
multi agent
finite state
markov decision process
decision processes
model based reinforcement learning