Overcoming Exploration in Reinforcement Learning with Demonstrations.
Ashvin NairBob McGrewMarcin AndrychowiczWojciech ZarembaPieter AbbeelPublished in: CoRR (2017)
Keyphrases
- reinforcement learning
- active exploration
- exploration strategy
- exploration exploitation
- action selection
- function approximation
- model based reinforcement learning
- exploration exploitation tradeoff
- model free
- state space
- reinforcement learning algorithms
- multi agent
- autonomous learning
- transfer learning
- balancing exploration and exploitation
- learning algorithm
- dynamic programming
- temporal difference learning
- optimal policy
- robot control
- information visualization
- transition model
- policy search
- learning classifier systems
- relevance feedback
- search engine
- learning tasks
- machine learning