Self-adaptive Inverse Soft-Q Learning for Imitation.

Zhuo Wang Quan Liu Xiongzhen Zhang

Published in: ICONIP (9) (2023)

Keyphrases

reinforcement learning
cooperative
function approximation
state space
multi agent
optimal policy
learning algorithm
stochastic approximation
imitation learning
continuous state and action spaces
temporal difference learning
model free
control parameters
data sets
least squares
decision making
multi agent reinforcement learning
real time