Self-adaptive Inverse Soft-Q Learning for Imitation.
Zhuo WangQuan LiuXiongzhen ZhangPublished in: ICONIP (9) (2023)
Keyphrases
- reinforcement learning
- cooperative
- function approximation
- state space
- multi agent
- optimal policy
- learning algorithm
- stochastic approximation
- imitation learning
- continuous state and action spaces
- temporal difference learning
- model free
- control parameters
- data sets
- least squares
- decision making
- multi agent reinforcement learning
- real time