Login / Signup
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards.
Alexander Trott
Stephan Zheng
Caiming Xiong
Richard Socher
Published in:
NeurIPS (2019)
Keyphrases
</>
reinforcement learning
reward function
bandit problems
markov decision processes
high dimensional
euclidean distance
transfer learning
state space
compressive sensing
machine learning
genetic algorithm
nearest neighbor
distance function
combinatorial optimization
solving problems
sparse data