Corruption-Robust Offline Reinforcement Learning with General Function Approximation.

Chenlu Ye Rui Yang Quanquan Gu Tong Zhang

Published in: CoRR (2023)

Keyphrases

function approximation
reinforcement learning
temporal difference
temporal difference learning algorithms
model free
temporal difference learning
mountain car
tile coding
reinforcement learning algorithms
state action space
learning tasks
radial basis function
function approximators
genetic algorithm
state space
neural network
action selection
step size
continuous state
policy evaluation
artificial neural networks
learning algorithm