Login / Signup

Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition.

Akihiko TakashimaRyo MasumuraAtsushi AndoYoshihiro YamazakiMihiro UchidaShota Orihashi
Published in: INTERSPEECH (2022)
Keyphrases
  • audio visual
  • emotion recognition
  • multi modal
  • cross modal
  • visual data
  • multi stream
  • visual information
  • facial expressions
  • machine learning
  • multimedia
  • xml documents
  • low level