Empirical Q-Value Iteration.

Dileep M. Kalathil Vivek S. Borkar Rahul Jain

Published in: CoRR (2014)

Keyphrases

markov decision processes
state space
theoretical analysis
infinite horizon
machine learning
heuristic search
databases
markov decision process
optimal policy
dynamic programming
neural network
markov chain
information systems
information theoretic
artificial intelligence
empirical data
data sets