Login / Signup
Inverse Reinforcement Learning with Unknown Reward Model based on Structural Risk Minimization.
Chendi Qu
Jianping He
Xiaoming Duan
Jiming Chen
Published in:
CoRR (2023)
Keyphrases
</>
inverse reinforcement learning
structural risk minimization
partially observable environments
reward function
empirical risk minimization
support vector
preference elicitation
state space
support vector regression
temporal difference
reinforcement learning
model free
support vector machine
utility function