Login / Signup
Emphatic TD Bellman Operator is a Contraction.
Assaf Hallak
Aviv Tamar
Shie Mannor
Published in:
CoRR (2015)
Keyphrases
</>
temporal difference learning
linear program
learning algorithm
piecewise linear
belief change
temporal difference
reinforcement learning
belief revision
databases
shape analysis
function approximation
reinforcement learning algorithms
function approximators
eligibility traces