Learning Stochastic Shortest Path with Linear Function Approximation.
Yifei MinJiafan HeTianhao WangQuanquan GuPublished in: CoRR (2021)
Keyphrases
- function approximation
- reinforcement learning
- learning tasks
- function approximators
- temporal difference learning algorithms
- td learning
- learning process
- active learning
- model free
- learning algorithm
- temporal difference
- connectionist networks
- machine learning
- finite state
- radial basis function
- monte carlo
- supervised learning
- multi agent
- temporal difference methods