Login / Signup
Expressing Arbitrary Reward Functions as Potential-Based Advice.
Anna Harutyunyan
Sam Devlin
Peter Vrancx
Ann Nowé
Published in:
AAAI (2015)
Keyphrases
</>
reward function
markov decision processes
optimal policy
transition probabilities
multiple agents
simple examples
dynamic systems
state variables