Inverse Q-Learning Using Input-Output Data.
Bosen LianWenqian XueFrank L. LewisAli DavoudiPublished in: IEEE Trans. Cybern. (2024)
Keyphrases
- reinforcement learning
- multi agent
- cooperative
- function approximation
- learning algorithm
- state space
- reinforcement learning algorithms
- stochastic approximation
- model free
- dynamic programming
- hierarchical reinforcement learning
- action selection
- orthogonal matrices
- databases
- td learning
- learning rate
- evaluation function
- markov chain
- social networks
- artificial intelligence