Voice Conversion Using Speech-to-Speech Neuro-Style Transfer.
Ehab A. AlBadawySiwei LyuPublished in: INTERSPEECH (2020)
Keyphrases
- text to speech
- speech recognition
- speech synthesis
- speech signal
- speech quality
- audio visual
- voice activity detection
- speech recognition errors
- dialogue system
- emotion recognition
- spoken language
- endpoint detection
- fundamental frequency
- information systems
- neural network
- automatic speech recognition
- speaker recognition
- multimodal interfaces
- multi stream
- vocal tract
- case study
- speech sounds
- audio stream
- learning algorithm
- prosodic features
- recognition engine
- human computer interaction
- speech processing
- artificial neural networks
- multi modal