Prior-dependent analysis of posterior sampling reinforcement learning with function approximation.

Yingru Li Zhi-Quan Luo

Published in: CoRR (2024)

Keyphrases

function approximation
reinforcement learning
temporal difference learning
radial basis function
temporal difference
state space
probability distribution
function approximators
temporal difference learning algorithms
learning tasks
neural network
artificial neural networks
model free
mountain car
reinforcement learning algorithms
multi agent
prior knowledge
small number