Login / Signup
PORF-DDPG: Learning Personalized Autonomous Driving Behavior with Progressively Optimized Reward Function.
Jie Chen
Tao Wu
Meiping Shi
Wei Jiang
Published in:
Sensors (2020)
Keyphrases
</>
learning algorithm
reinforcement learning
inverse reinforcement learning
prior knowledge
reward function
random walk
robotic systems