Login / Signup
Reinforcement Learning from Bagged Reward: A Transformer-based Approach for Instance-Level Reward Redistribution.
Yuting Tang
Xin-Qiang Cai
Yao-Xiang Ding
Qiyu Wu
Guoqing Liu
Masashi Sugiyama
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
instance level
state space
learning algorithm
reward function
supervised learning
optimal policy
multiple instance
pairwise
learning process
transfer learning
random forests