Login / Signup

A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification.

Qing WangJun DuSiyuan ZhengYunqing LiYajian WangYuzhong WuHu HuChao-Han Huck YangSabato Marco SiniscalchiYannan WangChin-Hui Lee
Published in: CoRR (2022)
Keyphrases
  • audio visual
  • data sets
  • visual data
  • data analysis
  • multi modal
  • scene classification
  • data points
  • image data
  • computer vision
  • information extraction
  • image classification
  • high dimensional data