Audio-visual word prominence detection from clean and noisy speech.
Martin HeckmannPublished in: Comput. Speech Lang. (2018)
Keyphrases
- audio visual
- noisy speech
- multi modal
- hidden markov models
- visual information
- speech recognition
- noisy environments
- multi stream
- speech signal
- background noise
- visual data
- audio visual speech recognition
- multimedia
- emotion recognition
- information retrieval
- speech enhancement
- low level
- pattern recognition
- multiscale
- speaker verification
- machine learning
- co occurrence
- signal to noise ratio
- keywords
- neural network