SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings.

Published in: ICIIT (2023)

Keyphrases