Publication: VD-SAN: Visual-Densely Semantic Attention Network for Image Caption Generation.