Login / Signup
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets.
Hossein Aboutalebi
Hwanjun Song
Yusheng Xie
Arshit Gupta
Justin Sun
Hang Su
Igor Shalyminov
Nikolaos Pappas
Siffi Singh
Saab Mansour
Published in:
CoRR (2024)
Keyphrases
</>
multi modal
multi modality
cross modal
audio visual
image segmentation
semantic concepts
machine learning
fully automatic
video search