Sign in

Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey.

Xiao WangGuangyao ChenGuangwu QianPengcheng GaoXiao-Yong WeiYaowei WangYonghong TianWen Gao
Published in: Mach. Intell. Res. (2023)
Keyphrases
  • multi modal
  • multi modality
  • audio visual
  • cross modal
  • pre trained
  • neural network
  • video sequences
  • small number
  • multiple modalities