Login / Signup
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification.
Takumi Tanabe
Rei Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
Published in:
NeurIPS (2022)
Keyphrases
</>
max min
worst case
objective function
computational complexity
cost function
hill climbing
learning algorithm
decision making
dynamic programming
support vector machine
least squares