Exploration by Distributional Reinforcement Learning.
Yunhao TangShipra AgrawalPublished in: CoRR (2018)
Keyphrases
- reinforcement learning
- active exploration
- exploration strategy
- action selection
- exploration exploitation
- model based reinforcement learning
- function approximation
- autonomous learning
- state space
- learning algorithm
- markov decision processes
- optimal control
- temporal difference learning
- co occurrence
- machine learning
- multi agent
- active learning
- learning capabilities
- learning process
- function approximators
- temporal difference
- reinforcement learning algorithms
- optimal policy
- markov decision process
- sufficient conditions
- model free
- dynamic programming
- learning classifier systems
- neural network
- dynamic environments
- robotic control
- learning problems