Login / Signup
Optimal policies search for sensor management : Application to the ESA radar.
Thomas Bréhard
Pierre-Arnaud Coquelin
Emmanuel Duflos
Philippe Vanheeghe
Published in:
FUSION (2008)
Keyphrases
</>
optimal policy
markov decision processes
search space
dynamic programming
search algorithm
state space
learning algorithm
reinforcement learning
sufficient conditions
heuristic search
decision problems
infinite horizon
finite horizon
multistage
long run
dynamic programming algorithms