Login / Signup
Multi-head attention fusion networks for multi-modal speech emotion recognition.
Junfeng Zhang
Lining Xing
Zhen Tan
Hongsen Wang
Kesheng Wang
Published in:
Comput. Ind. Eng. (2022)
Keyphrases
</>
multi modal
speech emotion recognition
multi modality
fusing multiple
cross modal
single modality
audio visual
focus of attention
video search
high dimensional
video sequences
humanoid robot
semantic concepts