Login / Signup

AttA-NET: Attention Aggregation Network for Audio-Visual Emotion Recognition.

Ruijia FanHong LiuYidi LiPeini GuoGuoquan WangTi Wang
Published in: ICASSP (2024)
Keyphrases
  • audio visual
  • emotion recognition
  • multi modal
  • visual information
  • speaker verification
  • visual data
  • multimedia
  • multi stream
  • human computer interaction
  • data mining
  • information retrieval
  • computer vision
  • context aware