Login / Signup
Dyna-Validator: A Model-based Reinforcement Learning Method with Validated Simulated Experiences.
Hengsheng Zhang
Jingchen Li
Ziming He
Jinhui Zhu
Haobin Shi
Published in:
Int. J. Comput. Commun. Control (2023)
Keyphrases
</>
objective function
pairwise
active learning
dynamic programming