Modeling Paragraph-Level Vision-Language Semantic Alignment for Multi-Modal Summarization.

Published in: CoRR (2022)

Keyphrases