Offline Learning in Markov Games with General Function Approximation.

Yuheng Zhang Yu Bai Nan Jiang

Published in: CoRR (2023)

Keyphrases

function approximation
reinforcement learning
learning tasks
learning algorithm
learning process
td learning
temporal difference methods
active learning
temporal difference
function approximators
radial basis function
reinforcement learning algorithms
neural network
artificial neural networks
supervised learning
model free