Publication: Improving Visual Storytelling with Multimodal Large Language Models.