Publication: Deep Transformer Q-Networks for Partially Observable Reinforcement Learning.