Sign in

Overfitting-avoiding goal-guided exploration for hard-exploration multi-goal reinforcement learning.

Changlin HanZhiyong PengYadong LiuJingsheng TangYang YuZongtan Zhou
Published in: Neurocomputing (2023)
Keyphrases
  • guided exploration
  • reinforcement learning
  • optimal policy
  • multi agent
  • cross validation
  • computer assisted