Learning Long-Term Reward Redistribution via Randomized Return Decomposition.

Zhizhou Ren Ruihan Guo Yuan Zhou Jian Peng

Published in: ICLR (2022)

Keyphrases

long term
reinforcement learning
learning process
learning algorithm
background knowledge
active learning
supervised learning
longer term
real time
short term
learning systems
empirical studies
online learning
hidden markov models
artificial neural networks
multi agent systems
neural network