Boosting Policy Learning in Reinforcement Learning via Adaptive Intrinsic Reward Regulation.
Qian ZhaoJinhui HanMao XuPublished in: IEEE Access (2024)
Keyphrases
- reinforcement learning
- partially observable environments
- learning algorithm
- actor critic
- learning process
- policy gradient
- eligibility traces
- inverse reinforcement learning
- learning problems
- action selection
- function approximation
- machine learning
- learning systems
- adaptive control
- rl algorithms
- learning capabilities
- policy search
- temporal difference learning
- partially observable
- markov decision processes
- online learning
- reinforcement learning algorithms
- reward function
- adaptive learning
- learning agent
- control policy
- average reward
- state action
- optimal policy
- agent learns
- mobile robot
- active learning