Zero Shot Audio To Audio Emotion Transfer With Speaker Disentanglement.
Soumya DuttaSriram GanapathyPublished in: ICASSP (2024)
Keyphrases
- audio visual
- emotion recognition
- multimedia
- audio stream
- speaker identification
- visual information
- visual data
- prosodic features
- speaker verification
- audio signals
- multimodal fusion
- digital audio
- audio files
- audio video
- audio features
- visual features
- multi modal
- signal processing
- information retrieval
- digital video
- feature space
- automatic transcription