Learning behaviors via human-delivered discrete feedback: modeling implicit feedback strategies to speed up learning.
Robert T. LoftinBei PengJames MacGlashanMichael L. LittmanMatthew E. TaylorJeff HuangDavid L. RobertsPublished in: Auton. Agents Multi Agent Syst. (2016)