Login / Signup

Dynamic Adjustment of Reward Function for Proximal Policy Optimization with Imitation Learning: Application to Automated Parking Systems.

Mohamad AlbilaniAmel Bouzeghoub
Published in: IV (2022)
Keyphrases
  • reward function
  • inverse reinforcement learning
  • optimal policy
  • imitation learning
  • state space
  • complex systems
  • dynamic environments
  • markov decision processes
  • reinforcement learning algorithms