Login / Signup
Distributed Policy Evaluation Under Multiple Behavior Strategies.
Sergio Valcarcel Macua
Jianshu Chen
Santiago Zazo
Ali H. Sayed
Published in:
IEEE Trans. Autom. Control. (2015)
Keyphrases
</>
policy evaluation
least squares
multi agent
learning algorithm
training data
temporal difference
model selection
markov decision processes
model free