Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning.

Published in: CoRR (2023)

Keyphrases