Login / Signup
Self-Practice Imitation Learning from Weak Policy.
Qing Da
Yang Yu
Zhi-Hua Zhou
Published in:
PSL (2013)
Keyphrases
</>
imitation learning
reinforcement learning
robotic systems
maximum margin
optimal policy
action selection
markov decision process
real time
video sequences
high dimensional