Login / Signup
Optimization Issues in KL-Constrained Approximate Policy Iteration.
Nevena Lazic
Botao Hao
Yasin Abbasi-Yadkori
Dale Schuurmans
Csaba Szepesvári
Published in:
CoRR (2021)
Keyphrases
</>
evolutionary algorithm
dynamic environments