Improving On-policy Learning with Statistical Reward Accumulation.

Yubin Deng Ke Yu Dahua Lin Xiaoou Tang Chen Change Loy

Published in: CoRR (2018)

Keyphrases

reinforcement learning
learning process
learning algorithm
learning systems
learning problems
neural network
bayesian networks
reinforcement learning methods
partially observable environments