Login / Signup
Multicar Elevator Group Control: Average Reward Learning Method for Service Completion Time Reduction and Interference Prevention.
Alex Valdivielso
Toshiyuki Miyamoto
Published in:
CCA (2010)
Keyphrases
</>
learning algorithm
objective function
dynamic programming
supervised learning
reinforcement learning
least squares
support vector machine svm
optimal policy
average reward