Publication: GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features.