Login / Signup

A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition.

R. Gnana PraveenWheidima Carneiro de MeloNasib UllahHaseeb AslamOsama ZeeshanThéo DenormeMarco PedersoliAlessandro L. KoerichSimon BaconPatrick CardinalEric Granger
Published in: CVPR Workshops (2022)
Keyphrases
  • audio visual
  • emotion recognition
  • multi modal
  • feature selection
  • machine learning
  • three dimensional
  • high level
  • image data
  • input data