Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning.
Lingwei ZhuZheng ChenEiji UchibeTakamitsu MatsubaraPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- learning algorithm
- learning problems
- learning process
- supervised learning
- special case
- state space
- prior knowledge
- dynamic programming
- mobile robot
- machine learning
- online learning
- learning systems
- knowledge acquisition
- decision trees
- optimal control
- partially observable
- robot control
- temporal difference learning
- autonomous learning
- evolutionary learning