Login / Signup
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay.
Sameera Lanka
Tianfu Wu
Published in:
CoRR (2018)
Keyphrases
</>
reinforcement learning
data mining
multiresolution
data sets
genetic algorithm
search engine
three dimensional
domain knowledge
mobile robot
markov decision processes
learning curve
multiarmed bandit