Login / Signup
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator.
Ryoma Furuyama
Daiki Kuyoshi
Satoshi Yamane
Published in:
CoRR (2024)
Keyphrases
</>
imitation learning
reinforcement learning
robotic systems
humanoid robot
maximum margin
state space
graphical models
function approximation