A novel multi-step reinforcement learning method for solving reward hacking.

Published in: Appl. Intell. (2019)

Keyphrases