A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification.
Qing WangJun DuSiyuan ZhengYunqing LiYajian WangYuzhong WuHu HuChao-Han Huck YangSabato Marco SiniscalchiYannan WangChin-Hui LeePublished in: ISCSLP (2022)