Login / Signup
NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning.
Sirui Xie
Junning Huang
Lanxin Lei
Chunxiao Liu
Zheng Ma
Wei Zhang
Liang Lin
Published in:
CoRR (2018)
Keyphrases
</>
dynamic programming
reinforcement learning
segmentation method
learning algorithm
similarity measure
pairwise
d objects
energy function
kalman filter