Login / Signup
REFINESUMM: Self-Refining MLLM for Generating a Multimodal Summarization Dataset.
Vaidehi Patil
Leonardo F. R. Ribeiro
Mengwen Liu
Mohit Bansal
Markus Dreyer
Published in:
ACL (1) (2024)
Keyphrases
</>
multi modal
neural network
information retrieval
synthetic datasets
multi document summarization
generation process
genetic algorithm
decision trees
benchmark datasets
audio visual
training dataset
video summarization
automatically generating
multimodal interaction
music retrieval