Sign in

Potential-based reward shaping for finite horizon online POMDP planning.

Adam EckLeen-Kiat SohSam DevlinDaniel Kudenko
Published in: Auton. Agents Multi Agent Syst. (2016)
Keyphrases