Login / Signup
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning.
Kang Xu
Yan Ma
Wei Li
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
machine learning
markov decision processes
robust estimation
selection criteria
real time
case study
multi agent
learning process
state space
user preferences