Login / Signup

Multi-head attention fusion networks for multi-modal speech emotion recognition.

Junfeng ZhangLining XingZhen TanHongsen WangKesheng Wang
Published in: Comput. Ind. Eng. (2022)
Keyphrases
  • multi modal
  • speech emotion recognition
  • multi modality
  • fusing multiple
  • cross modal
  • single modality
  • audio visual
  • focus of attention
  • video search
  • high dimensional
  • video sequences
  • humanoid robot
  • semantic concepts