Improved model-free H∞ control for batch processes via off-policy 2D game Q-learning.
Xueying JiangMin HuangHanbin KuangHuiyuan ShiXingwei WangLoo Hay LeePublished in: Int. J. Control (2023)
Keyphrases
- model free
- reinforcement learning
- impedance control
- reinforcement learning algorithms
- function approximation
- policy iteration
- temporal difference
- control system
- game theory
- stochastic games
- average reward
- temporal difference learning
- optimal control
- state space
- policy evaluation
- evaluation function
- nash equilibrium
- game playing