Publication: Deep reinforcement learning collision avoidance using policy gradient optimisation and Q-learning.