Login / Signup

Multicar Elevator Group Control: Average Reward Learning Method for Service Completion Time Reduction and Interference Prevention.

Alex ValdivielsoToshiyuki Miyamoto
Published in: CCA (2010)
Keyphrases
  • learning algorithm
  • objective function
  • dynamic programming
  • supervised learning
  • reinforcement learning
  • least squares
  • support vector machine svm
  • optimal policy
  • average reward