Login / Signup

Learning to Answer Questions in Dynamic Audio-Visual Scenarios.

Guangyao LiYake WeiYapeng TianChenliang XuJi-Rong WenDi Hu
Published in: CoRR (2022)
Keyphrases
  • audio visual
  • answer questions
  • domain knowledge
  • feature extraction
  • high dimensional
  • multi modal