Login / Signup
Balancing Policy Improvement and Evaluation in Risk-Sensitive Satisficing Algorithm.
Hiroaki Wakabayashi
Takumi Kamiya
Tatsuji Takahashi
Published in:
JSAI (2020)
Keyphrases
</>
learning algorithm
objective function
optimal solution
computational complexity
dynamic programming
foraging theory
monte carlo
np hard
linear programming
convergence rate
optimality criterion