A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition.
Gnana Praveen RajasekharWheidima Carneiro de MeloNasib UllahHaseeb AslamOsama ZeeshanThéo DenormeMarco PedersoliAlessandro L. KoerichPatrick CardinalEric GrangerPublished in: CoRR (2022)