Gap-Dependent Unsupervised Exploration for Reinforcement Learning.
Jingfeng WuVladimir BravermanLin F. YangPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- active exploration
- exploration strategy
- action selection
- supervised learning
- model based reinforcement learning
- unsupervised learning
- function approximation
- completely unsupervised
- state space
- exploration exploitation
- markov decision processes
- autonomous learning
- semi supervised
- temporal difference
- optimal control
- supervised classification
- machine learning
- model free
- reinforcement learning algorithms
- data driven
- multi agent
- robotic control
- neural network
- learning problems
- transfer learning
- weakly supervised
- robot control
- unsupervised manner
- optimal policy
- multi agent reinforcement learning
- learning process
- information retrieval