Disentangling Exploration and Exploitation in Deep Reinforcement Learning Using Contingency Awareness.
Ionel HosuTraian RebedeaStefan Trausan-MatuPublished in: ICONIP (5) (2022)
Keyphrases
- exploration exploitation tradeoff
- reinforcement learning
- active exploration
- function approximation
- exploration strategy
- objective function
- relevance feedback
- action selection
- exploration exploitation
- state space
- autonomous learning
- active learning
- optimal policy
- model based reinforcement learning
- multi agent
- machine learning
- model free
- search capabilities
- markov decision processes
- reinforcement learning algorithms
- optimal control
- search strategies
- data sets
- learning process
- information systems
- data mining
- neural network