Login / Signup
Path Consistency Learning in Tsallis Entropy Regularized MDPs.
Ofir Nachum
Yinlam Chow
Mohammad Ghavamzadeh
Published in:
CoRR (2018)
Keyphrases
</>
reinforcement learning
learning algorithm
special case
knowledge base
functional dependencies
temporal reasoning
path consistency
consistency checking