Speaker-invariant suprasegmental temporal features in normal and disguised speech.
Adrian LeemannMarie-José KollyPublished in: Speech Commun. (2015)
Keyphrases
- speech recognition
- speaker recognition
- audio visual
- automatic speech recognition
- speaker verification
- speaker identification
- prosodic features
- speech signal
- vocal tract
- speech synthesis
- speaker dependent
- automatic speech recognition systems
- speaker diarization
- noisy environments
- speaker adaptation
- multi modal
- phoneme recognition
- affine transformation
- speech recognizer
- speech sounds
- language model
- vector quantization
- acoustic features
- affine invariant
- hidden markov models
- broadcast news
- gaussian mixture model
- synthesized speech
- automatic transcription
- pattern recognition
- audio stream
- moment invariants
- visual information
- recognition engine
- acoustic models
- multimedia
- speech recognition systems
- spontaneous speech
- spectral features