Learning Stochastic Shortest Path with Linear Function Approximation.

Yifei Min Jiafan He Tianhao Wang Quanquan Gu

Published in: CoRR (2021)

Keyphrases

function approximation
reinforcement learning
learning tasks
function approximators
temporal difference learning algorithms
td learning
learning process
active learning
model free
learning algorithm
temporal difference
connectionist networks
machine learning
finite state
radial basis function
monte carlo
supervised learning
multi agent
temporal difference methods