Login / Signup
Path Consistency Learning in Tsallis Entropy Regularized MDPs.
Yinlam Chow
Ofir Nachum
Mohammad Ghavamzadeh
Published in:
ICML (2018)
Keyphrases
</>
reinforcement learning
path consistency
learning algorithm
least squares
constraint satisfaction problems
learning process
markov decision processes
knowledge base
computational complexity
image retrieval
database design