Overcoming Exploration in Reinforcement Learning with Demonstrations.

Ashvin Nair Bob McGrew Marcin Andrychowicz Wojciech Zaremba Pieter Abbeel

Published in: CoRR (2017)

Keyphrases

reinforcement learning
active exploration
exploration strategy
exploration exploitation
action selection
function approximation
model based reinforcement learning
exploration exploitation tradeoff
model free
state space
reinforcement learning algorithms
multi agent
autonomous learning
transfer learning
balancing exploration and exploitation
learning algorithm
dynamic programming
temporal difference learning
optimal policy
robot control
information visualization
transition model
policy search
learning classifier systems
relevance feedback
search engine
learning tasks
machine learning