A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition.
R. Gnana PraveenWheidima Carneiro de MeloNasib UllahHaseeb AslamOsama ZeeshanThéo DenormeMarco PedersoliAlessandro L. KoerichSimon BaconPatrick CardinalEric GrangerPublished in: CVPR Workshops (2022)