• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

A Reward Shaping Method based on Meta-LSTM for Continuous Control of Robot.

Jixun YaoXiaoan LiDengshan Huang
Published in: CSAI (2020)
Keyphrases
  • objective function
  • learning algorithm
  • computational complexity
  • dynamic programming
  • neural network
  • probabilistic model
  • state space
  • markov model