Login / Signup

Multimodal Network with Cross-Modal Attention for Audio-Visual Event Localization.

Qianchao Tan
Published in: HCMA@MM (2022)
Keyphrases
  • audio visual
  • cross modal
  • multi modal
  • visual data
  • visual information
  • multi stream
  • visual features
  • multimedia
  • dimensionality reduction
  • video data
  • multimedia retrieval
  • visual similarity