Sign in

Cross-modal distillation with audio-text fusion for fine-grained emotion classification using BERT and Wav2vec 2.0.

Donghwa KimPilsung Kang
Published in: Neurocomputing (2022)
Keyphrases