Login / Signup
Multi-objective reinforcement learning method for acquiring all pareto optimal policies simultaneously.
Yusuke Mukai
Yasuaki Kuroe
Hitoshi Iima
Published in:
SMC (2012)
Keyphrases
</>
multi objective
reinforcement learning
optimal policy
dynamic programming
optimization algorithm
machine learning
objective function
evolutionary algorithm
model free
markov decision processes
data mining
computational complexity
sufficient conditions
long run