Offline Learning in Markov Games with General Function Approximation.
Yuheng ZhangYu BaiNan JiangPublished in: CoRR (2023)
Keyphrases
- function approximation
- reinforcement learning
- learning tasks
- learning algorithm
- learning process
- td learning
- temporal difference methods
- active learning
- temporal difference
- function approximators
- radial basis function
- reinforcement learning algorithms
- neural network
- artificial neural networks
- supervised learning
- model free