Model-Based Offline Meta-Reinforcement Learning with Regularization.
Sen LinJialin WanTengyu XuYingbin LiangJunshan ZhangPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- model free
- function approximation
- markov decision processes
- learning algorithm
- reinforcement learning algorithms
- multi agent reinforcement learning
- state space
- optimal policy
- temporal difference
- multi agent
- real time
- actor critic
- autonomous learning
- temporal difference learning
- learning problems
- mobile robot
- clustering algorithm
- machine learning