Addressing Hindsight Bias in Multigoal Reinforcement Learning.
Chenjia BaiLingxiao WangYixin WangZhaoran WangRui ZhaoChenyao BaiPeng LiuPublished in: IEEE Trans. Cybern. (2023)
Keyphrases
- reinforcement learning
- action selection
- function approximation
- reinforcement learning algorithms
- temporal difference
- model free
- state space
- markov decision processes
- learning algorithm
- multi agent reinforcement learning
- temporal difference learning
- optimal control
- multi agent
- information systems
- neural network
- transfer learning
- optimal policy
- hidden markov models
- active learning
- learning classifier systems
- trade off
- objective function
- case study
- learning capabilities
- control problems
- autonomous learning
- transition model