Login / Signup

Distributed Policy Evaluation Under Multiple Behavior Strategies.

Sergio Valcarcel MacuaJianshu ChenSantiago ZazoAli H. Sayed
Published in: IEEE Trans. Autom. Control. (2015)
Keyphrases
  • policy evaluation
  • least squares
  • multi agent
  • learning algorithm
  • training data
  • temporal difference
  • model selection
  • markov decision processes
  • model free