Synthetic F0 Can Effectively Convey Speaker ID in Delexicalized Speech.
Eric MorleyEsther KlabbersJan P. H. van SantenAlexander KainSeyed Hamidreza MohammadiPublished in: INTERSPEECH (2012)
Keyphrases
- speech recognition
- speaker recognition
- audio visual
- automatic speech recognition
- speaker verification
- speaker identification
- speech signal
- speaker diarization
- real world
- vocal tract
- automatic speech recognition systems
- prosodic features
- speaker dependent
- speech synthesis
- spoken language
- spontaneous speech
- acoustic features
- text to speech
- multi modal
- visual information
- audio signals
- hidden markov models
- pattern recognition
- information retrieval
- real images are presented
- data sets
- noisy environments
- automatic transcription
- real time