Login / Signup

HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition.

Licai SunZheng LianBin LiuJianhua Tao
Published in: Inf. Fusion (2024)
Keyphrases
  • audio visual
  • emotion recognition
  • multi modal
  • visual information
  • speaker verification
  • visual data
  • multimedia
  • data sets
  • information extraction
  • image database