Login / Signup
Dynamics-Aware Comparison of Learned Reward Functions.
Blake Wulfe
Ashwin Balakrishna
Logan Ellis
Jean Mercat
Rowan McAllister
Adrien Gaidon
Published in:
CoRR (2022)
Keyphrases
</>
reward function
reinforcement learning
dynamical systems
state space
markov decision processes
learning algorithm
optimal policy
inverse reinforcement learning
machine learning
objective function
probabilistic model
multiple agents