Login / Signup
Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks.
Lucas Goncalves
Seong-Gyun Leem
Wei-Cheng Lin
Berrak Sisman
Carlos Busso
Published in:
CoRR (2023)
Keyphrases
</>
audio visual
multi modal
metadata
text classification
contextual information
temporal context