Login / Signup
Your Policy Regularizer is Secretly an Adversary.
Rob Brekelmans
Tim Genewein
Jordi Grau-Moya
Gregoire Detetang
Markus Kunesch
Shane Legg
Pedro A. Ortega
Published in:
Trans. Mach. Learn. Res. (2022)
Keyphrases
</>
optimal policy
policy making
semi supervised
database
genetic algorithm
management system
inverse reinforcement learning
neural network
total variation
multi task learning
management policies
policy search