Login / Signup
Robust reinforcement learning under minimax regret for green security.
Lily Xu
Andrew Perrault
Fei Fang
Haipeng Chen
Milind Tambe
Published in:
UAI (2021)
Keyphrases
</>
reinforcement learning
minimax regret
information security
learning process
machine learning
preference elicitation
optimal policy
utility function