Example-guided learning of stochastic human driving policies using deep reinforcement learning.
Ran EmunaRotem DuffneyAvinoam BorowskyArmin BiessPublished in: Neural Comput. Appl. (2023)
Keyphrases
- reinforcement learning
- learning algorithm
- learning process
- optimal policy
- learning systems
- supervised learning
- control policies
- active learning
- multi agent
- model free reinforcement learning
- learning environment
- deep architectures
- hierarchical reinforcement learning
- autonomous learning
- continuous state
- human learning
- learning automata
- markov decision process
- action selection
- machine learning
- state space
- dynamic programming