Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients.

Published in: CoRR (2024)

Keyphrases