DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning.

Published in: NeurIPS (2022)

Keyphrases