Login / Signup
Empirical Q-Value Iteration.
Dileep M. Kalathil
Vivek S. Borkar
Rahul Jain
Published in:
CoRR (2014)
Keyphrases
</>
markov decision processes
state space
theoretical analysis
infinite horizon
machine learning
heuristic search
databases
markov decision process
optimal policy
dynamic programming
neural network
markov chain
information systems
information theoretic
artificial intelligence
empirical data
data sets