Publication: Geometry Attention Transformer with Position-aware LSTMs for Image Captioning.