Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning.

Aritra Mitra George J. Pappas Hamed Hassani

Published in: CoRR (2023)

Keyphrases

temporal difference learning
reinforcement learning
function approximation
evaluation function
reinforcement learning algorithms
temporal difference
fixed point
game playing
approximate value iteration
markov decision process
function approximators
policy iteration
model free
machine learning
learning algorithm
optimal policy
state space
action selection
learning process
multi agent