On Reward-Free Reinforcement Learning with Linear Function Approximation.

Ruosong Wang Simon S. Du Lin F. Yang Russ R. Salakhutdinov

Published in: NeurIPS (2020)

Keyphrases

function approximation
reinforcement learning
temporal difference learning algorithms
function approximators
temporal difference
model free
reinforcement learning algorithms
temporal difference learning
state space
machine learning
mountain car
continuous state
reinforcement learning methods
learning algorithm
markov decision processes
supervised learning
transfer learning
artificial neural networks
policy search
markov decision problems
optimal policy
action space
learning process
dynamic programming
optimal control
policy gradient
radial basis function
temporal difference methods
neural network