Login / Signup

A Reward Shaping Method based on Meta-LSTM for Continuous Control of Robot.

Jixun YaoXiaoan LiDengshan Huang
Published in: CSAI (2020)
Keyphrases
  • objective function
  • learning algorithm
  • computational complexity
  • dynamic programming
  • neural network
  • probabilistic model
  • state space
  • markov model