Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning.
Zheng WuYichen XieWenzhao LianChanghao WangYanjiang GuoJianyu ChenStefan SchaalMasayoshi TomizukaPublished in: ICRA (2023)
Keyphrases
- reinforcement learning
- optimal policy
- transfer learning
- approximate dynamic programming
- policy search
- markov decision process
- machine learning
- learning algorithm
- partially observable environments
- policy gradient
- reinforcement learning algorithms
- action selection
- state space
- model free
- finite state
- knowledge transfer
- policy iteration
- action space
- control policy
- markov decision processes
- policy evaluation
- multi agent
- multiscale