• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Average-Reward Reinforcement Learning with Trust Region Methods.

Xiaoteng MaXiaohang TangLi XiaJun YangQianchuan Zhao
Published in: IJCAI (2021)
Keyphrases
  • evolutionary algorithm
  • optimization problems
  • optimization methods
  • genetic algorithm
  • trust region
  • neural network