Generative Adversarial Inverse Reinforcement Learning With Deep Deterministic Policy Gradient.
Ming ZhanJingjing FanJianying GuoPublished in: IEEE Access (2023)
Keyphrases
- inverse reinforcement learning
- policy gradient
- reward function
- reinforcement learning algorithms
- reinforcement learning
- preference elicitation
- function approximation
- gradient method
- temporal difference
- generative model
- optimal control
- state action
- multi agent
- single agent
- machine learning
- markov decision processes
- long run
- model selection
- average reward