Login / Signup
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets.
Hossein Aboutalebi
Hwanjun Song
Yusheng Xie
Arshit Gupta
Lijia Sun
Hang Su
Igor Shalyminov
Nikolaos Pappas
Siffi Singh
Saab Mansour
Published in:
NAACL-HLT (2024)
Keyphrases
</>
multi modal
audio visual
multi modality
cross modal
image annotation
high level
information theoretic