Login / Signup
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble.
Chao Li
Fengge Wu
Junsuo Zhao
Published in:
IJCNN (2023)
Keyphrases
</>
imitation learning
optimal policy
maximum margin
training set
learning algorithm
reinforcement learning
real time
machine learning
computer vision
dynamic programming
support vector machine
multi modal
robotic systems