Sign in

Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis.

Hengshun ZhouJun DuGongzhen ZouZhaoxu NianChin-Hui LeeSabato Marco SiniscalchiShinji WatanabeOdette ScharenborgJingdong ChenShifu XiongJianqing Gao
Published in: INTERSPEECH (2022)
Keyphrases
  • audio visual
  • multi modal
  • word spotting
  • multimedia
  • multi stream
  • feature selection
  • visual information