Login / Signup
Dynamic Adjustment of Reward Function for Proximal Policy Optimization with Imitation Learning: Application to Automated Parking Systems.
Mohamad Albilani
Amel Bouzeghoub
Published in:
IV (2022)
Keyphrases
</>
reward function
inverse reinforcement learning
optimal policy
imitation learning
state space
complex systems
dynamic environments
markov decision processes
reinforcement learning algorithms