Sign in

Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning.

Kang XuYan MaWei Li
Published in: CoRR (2022)
Keyphrases
  • reinforcement learning
  • machine learning
  • markov decision processes
  • robust estimation
  • selection criteria
  • real time
  • case study
  • multi agent
  • learning process
  • state space
  • user preferences