Aligning Visual Regions and Textual Concepts: Learning Fine-Grained Image Representations for Image Captioning.

Published in: CoRR (2019)

Keyphrases