Filter dates
Overview
- indian languages
- noisy environments
- speech synthesis
- neural network
- prosodic features
Publications
A multi-modal lecture video indexing and retrieval framework with multi-scale residual attention network and multi-similarity computation.
Signal Image Video Process.
Hierarchical emotion recognition from speech using source, power spectral and prosodic features.
Multim. Tools Appl.
Transfer Accent Identification Learning for Enhancing Speech Emotion Recognition.
Circuits Syst. Signal Process.
Automatic classification of neurological voice disorders using wavelet scattering features.
Speech Commun.
ExtSwap: Leveraging Extended Latent Mapper for Generating High Quality Face Swapping.
CoRR
Accent classification from an emotional speech in clean and noisy environments.
Multim. Tools Appl.