Publication: GRIT: Faster and Better Image captioning Transformer Using Dual Visual Features.