Sign in

Cross-Modal Learning for Audio-Visual Video Parsing.

Jatin Lamba AbhishekJayaprakash AkulaRishabh DabralPreethi JyothiGanesh Ramakrishnan
Published in: Interspeech (2021)
Keyphrases
  • audio visual
  • multi modal
  • cross modal
  • visual data
  • visual recognition
  • video data
  • computer vision
  • multimedia
  • high level
  • training set
  • low level
  • space time
  • key frames
  • video analysis