Byzantine-Resilient Decentralized Policy Evaluation With Linear Function Approximation.

Zhaoxian Wu Han Shen Tianyi Chen Qing Ling

Published in: IEEE Trans. Signal Process. (2021)

Keyphrases

function approximation
policy evaluation
temporal difference
temporal difference learning algorithms
reinforcement learning
model free
function approximators
td learning
radial basis function
learning tasks
reinforcement learning algorithms
least squares
semi parametric
linear model
policy iteration
multi agent
data mining
variance reduction
monte carlo
learning algorithm