A combination of speaker normalization and speech rate normalization for automatic speech recognition.
Thilo PfauRobert FaltlhauserGünther RuskePublished in: INTERSPEECH (2000)
Keyphrases
- automatic speech recognition
- speech recognition
- speech signal
- word error rate
- hidden markov models
- broadcast news
- conversational speech
- speech retrieval
- speech corpus
- recognition errors
- speech sounds
- spoken words
- speaker identification
- acoustic features
- spontaneous speech
- noisy environments
- neural network
- word recognition
- speaker adaptation
- vocal tract
- acoustic models
- speech recognition systems
- speech recognizer
- speaker dependent
- linear prediction
- non stationary
- speech segments
- phoneme recognition
- pattern recognition
- computer vision