Dealing with uncertainty: Balancing exploration and exploitation in deep recurrent reinforcement learning.
Valentina ZangirolamiMatteo BorrottiPublished in: Knowl. Based Syst. (2024)
Keyphrases
- balancing exploration and exploitation
- reinforcement learning
- learning to rank
- function approximation
- state space
- partial observability
- reinforcement learning algorithms
- recurrent neural networks
- optimal control
- feed forward
- machine learning
- uncertain data
- dynamic programming
- model free
- multi agent
- robust optimization
- web search
- supervised learning
- decision theory
- temporal difference
- decision making
- feature selection