Login / Signup
Maximum Expected Hitting Cost of a Markov Decision Process and Informativeness of Rewards.
Falcon Z. Dai
Matthew R. Walter
Published in:
CoRR (2019)
Keyphrases
</>
social networks
reward function
optimal policy
reinforcement learning
high cost
total cost
markov decision processes
optimal solution
hidden markov models
opportunity cost
data mining
database systems
long run average cost