Fusion of spectral and prosody modelling for multilingual speech emotion conversion.
Susmitha VekkotDeepa GuptaPublished in: Knowl. Based Syst. (2022)
Keyphrases
- text to speech
- speech synthesis
- text to speech synthesis
- audio visual
- multimodal fusion
- emotion recognition
- speech recognition
- prosodic features
- multi lingual
- spectral features
- linear predictive coding
- information fusion
- emotional state
- multi stream
- emotional speech
- synthesized speech
- fusion method
- multimodal interfaces
- digital libraries
- linear prediction
- spectral analysis
- hyperspectral imagery
- data fusion
- cross lingual
- human computer interaction
- automatic speech recognition
- cross language
- multi sensor
- vocal tract
- multi spectral images
- speaker verification
- multi modal
- speaker adaptation
- spectral images
- information retrieval
- image fusion
- recognition engine
- hyperspectral images
- affective states
- facial expressions
- pattern recognition