Login / Signup
STARC: A General Framework For Quantifying Differences Between Reward Functions.
Joar Skalse
Lucy Farnik
Sumeet Ramesh Motwani
Erik Jenner
Adam Gleave
Alessandro Abate
Published in:
CoRR (2023)
Keyphrases
</>
reward function
markov decision processes
reinforcement learning
inverse reinforcement learning
policy search
multiple agents
transition probabilities
statistically significant
state variables
markov decision process
state space
optimal policy
reinforcement learning algorithms
markov decision problems
simple examples
dynamical systems
random walk
linear programming
data mining