• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Multi-head attention fusion networks for multi-modal speech emotion recognition.

Junfeng ZhangLining XingZhen TanHongsen WangKesheng Wang
Published in: Comput. Ind. Eng. (2022)
Keyphrases
  • multi modal
  • speech emotion recognition
  • multi modality
  • fusing multiple
  • cross modal
  • single modality
  • audio visual
  • focus of attention
  • video search
  • high dimensional
  • video sequences
  • humanoid robot
  • semantic concepts