Protecting Reward Function of Reinforcement Learning via Minimal and Non-catastrophic Adversarial Trajectory.

Published in: SRDS (2021)

Keyphrases