Inverse Q-Learning Using Input-Output Data.

Bosen Lian Wenqian Xue Frank L. Lewis Ali Davoudi

Published in: IEEE Trans. Cybern. (2024)

Keyphrases

reinforcement learning
multi agent
cooperative
function approximation
learning algorithm
state space
reinforcement learning algorithms
stochastic approximation
model free
dynamic programming
hierarchical reinforcement learning
action selection
orthogonal matrices
databases
td learning
learning rate
evaluation function
markov chain
social networks
artificial intelligence