Addressing Hindsight Bias in Multigoal Reinforcement Learning.

Chenjia Bai Lingxiao Wang Yixin Wang Zhaoran Wang Rui Zhao Chenyao Bai Peng Liu

Published in: IEEE Trans. Cybern. (2023)

Keyphrases

reinforcement learning
action selection
function approximation
reinforcement learning algorithms
temporal difference
model free
state space
markov decision processes
learning algorithm
multi agent reinforcement learning
temporal difference learning
optimal control
multi agent
information systems
neural network
transfer learning
optimal policy
hidden markov models
active learning
learning classifier systems
trade off
objective function
case study
learning capabilities
control problems
autonomous learning
transition model