Login / Signup
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback.
Baicen Xiao
Qifan Lu
Bhaskar Ramasubramanian
Andrew Clark
Linda Bushnell
Radha Poovendran
Published in:
AAMAS (2020)
Keyphrases
</>
high dimensional
state space
reward shaping
reinforcement learning algorithms
human subjects
machine learning
reinforcement learning
search space
human experts
objective function
markov chain
complex domains
mixed initiative