Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification.

Published in: NeurIPS (2022)

Keyphrases