Login / Signup

CM-CS: Cross-Modal Common-Specific Feature Learning For Audio-Visual Video Parsing.

Hongbo ChenDongchen ZhuGuanghui ZhangWenjun ShiXiaolin ZhangJiamao Li
Published in: ICASSP (2023)
Keyphrases
  • audio visual
  • cross modal
  • multi modal
  • visual data
  • feature selection
  • domain knowledge
  • natural language processing
  • video frames
  • visual recognition