Understanding Reward Ambiguity Through Optimal Transport Theory in Inverse Reinforcement Learning.

Published in: CoRR (2023)

Keyphrases

inverse reinforcement learning
partially observable environments
reward function
bayesian nonparametric
preference elicitation
dynamic programming
optimal control
multiple agents
optimal solution
machine learning
special case
state space
utility function
decision theory
solving problems
average reward