Adaptive Policy Learning for Offline-to-Online Reinforcement Learning.
Han ZhengXufang LuoPengfei WeiXuan SongDongsheng LiJing JiangPublished in: AAAI (2023)
Keyphrases
- reinforcement learning
- online learning
- learning algorithm
- learning process
- learning capabilities
- supervised learning
- actor critic
- real time
- learning systems
- policy search
- reinforcement learning algorithms
- active learning
- optimal policy
- learning tasks
- adaptive learning
- action selection
- eligibility traces
- autonomous learning
- partially observable environments
- function approximators
- prior knowledge
- neural network
- learning agents
- adaptive control
- evolutionary learning
- multi agent reinforcement learning
- learning problems
- reinforcement learning problems
- e learning