Login / Signup
Simultaneous Translation with Flexible Policy via Restricted Imitation Learning.
Baigong Zheng
Renjie Zheng
Mingbo Ma
Liang Huang
Published in:
CoRR (2019)
Keyphrases
</>
imitation learning
reinforcement learning
robotic systems
maximum margin
humanoid robot
action selection
data points
multi modal
optimal policy
relational domains