Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency.

Qi Cai Zhuoran Yang Zhaoran Wang

Published in: ICML (2022)

Keyphrases

function approximation
reinforcement learning
temporal difference learning algorithms
function approximators
temporal difference learning
mountain car
temporal difference
radial basis function
state action space
learning tasks
tile coding
model free
td learning
reinforcement learning algorithms
state space
dynamic programming
neural network
small number
multi agent
machine learning
real valued
optimal control
action selection
support vector machine
policy evaluation
reinforcement learning problems
temporal difference methods
learning algorithm