Login / Signup
Selective imitation on the basis of reward function similarity.
Max Taylor-Davies
Stephanie Droop
Chris Lucas
Published in:
CogSci (2023)
Keyphrases
</>
reward function
reinforcement learning
inverse reinforcement learning
similarity measure
state space
markov decision processes
reinforcement learning algorithms
markov decision process
hierarchical reinforcement learning
data mining
learning algorithm
higher order
function approximation
transition model