On Reward-Free Reinforcement Learning with Linear Function Approximation.

Ruosong Wang Simon S. Du Lin F. Yang Ruslan Salakhutdinov

Published in: CoRR (2020)

Keyphrases

function approximation
reinforcement learning
temporal difference learning algorithms
function approximators
temporal difference
temporal difference learning
state space
reinforcement learning algorithms
model free
mountain car
markov decision processes
learning process
radial basis function
policy gradient
multi agent
machine learning
transfer learning
optimal policy
reward function
supervised learning
dynamic programming
learning algorithm
learning tasks
learning agent
average reward
state action
markov decision problems
actor critic
neural network