Login / Signup
Learning Long-Term Reward Redistribution via Randomized Return Decomposition.
Zhizhou Ren
Ruihan Guo
Yuan Zhou
Jian Peng
Published in:
ICLR (2022)
Keyphrases
</>
long term
reinforcement learning
learning process
learning algorithm
background knowledge
active learning
supervised learning
longer term
real time
short term
learning systems
empirical studies
online learning
hidden markov models
artificial neural networks
multi agent systems
neural network