Login / Signup
Average-Reward Reinforcement Learning with Trust Region Methods.
Xiaoteng Ma
Xiaohang Tang
Li Xia
Jun Yang
Qianchuan Zhao
Published in:
IJCAI (2021)
Keyphrases
</>
evolutionary algorithm
optimization problems
optimization methods
genetic algorithm
trust region
neural network