Byzantine-Resilient Decentralized Policy Evaluation With Linear Function Approximation.
Zhaoxian WuHan ShenTianyi ChenQing LingPublished in: IEEE Trans. Signal Process. (2021)
Keyphrases
- function approximation
- policy evaluation
- temporal difference
- temporal difference learning algorithms
- reinforcement learning
- model free
- function approximators
- td learning
- radial basis function
- learning tasks
- reinforcement learning algorithms
- least squares
- semi parametric
- linear model
- policy iteration
- multi agent
- data mining
- variance reduction
- monte carlo
- learning algorithm