C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Lambda-Policy Iteration: A Review and a New Implementation.
Dimitri P. Bertsekas
Published in:
CoRR (2015)
Keyphrases
</>
policy iteration
markov decision processes
fixed point
optimal policy
reinforcement learning
model free
search algorithm
sample path
average reward
state space
linear programming
mathematical model
finite state
markov decision process
markov decision problems