Posterior Sampling for Competitive RL: Function Approximation and Partial Observation.

Shuang Qiu Ziyu Dai Han Zhong Zhaoran Wang Zhuoran Yang Tong Zhang

Published in: NeurIPS (2023)

Keyphrases

function approximation
reinforcement learning
tile coding
model free
temporal difference
temporal difference learning algorithms
temporal difference learning
learning tasks
reinforcement learning algorithms
radial basis function
function approximators
probability distribution
state space
temporal difference methods
td learning
machine learning
real valued
feature extraction
decision trees
reinforcement learning problems
genetic algorithm