Login / Signup

MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets.

Hossein AboutalebiHwanjun SongYusheng XieArshit GuptaLijia SunHang SuIgor ShalyminovNikolaos PappasSiffi SinghSaab Mansour
Published in: NAACL-HLT (2024)
Keyphrases
  • multi modal
  • audio visual
  • multi modality
  • cross modal
  • image annotation
  • high level
  • information theoretic