CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation.

Published in: CoRR (2024)

Keyphrases