Sign in

Combination of learning from non-optimal demonstrations and feedbacks using inverse reinforcement learning and Bayesian policy improvement.

Ali EzzeddineNafee MouradBabak Nadjar AraabiMajid Nili Ahmadabadi
Published in: Expert Syst. Appl. (2018)
Keyphrases