Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation.

Uri Sherman Tomer Koren Yishay Mansour

Published in: CoRR (2023)

Keyphrases

function approximation
reinforcement learning
function approximators
temporal difference learning algorithms
online learning
temporal difference
temporal difference learning
learning tasks
reinforcement learning algorithms
radial basis function
model free
markov decision processes
state space
mountain car
temporal difference methods
supervised learning
td learning
transfer learning
machine learning
least squares
pattern recognition
support vector
genetic algorithm