Login / Signup
Smooth and Stopping Interval Aware Driving Behavior Prediction at Un-signalized Intersection with Inverse Reinforcement Learning on Sequential MDPs.
Shaoyu Yang
Hiroshi Yoshitake
Motoki Shino
Masamichi Shimosaka
Published in:
IV (2021)
Keyphrases
</>
inverse reinforcement learning
reward function
markov decision processes
preference elicitation
state space
driving behavior
reinforcement learning
finite horizon
markov decision process
optimal policy
temporal difference