Curriculum Goal-Conditioned Imitation for Offline Reinforcement Learning.
Xiaoyun FengLi JiangXudong YuHaoran XuXiaoyan SunJie WangXianyuan ZhanWai Kin ChanPublished in: IEEE Trans. Games (2024)
Keyphrases
- reinforcement learning
- function approximation
- imitation learning
- reinforcement learning methods
- model free
- reinforcement learning algorithms
- learning algorithm
- multi agent
- state space
- learning classifier systems
- action selection
- learning goals
- transition model
- real time
- temporal difference learning
- markov decision process
- learning problems
- machine learning
- neural network