Publication: Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment.