Using audio-visual features for robust voice activity detection in clean and noisy speech.
Ibrahim AlmajaiBen P. MilnerPublished in: EUSIPCO (2008)
Keyphrases
- visual features
- voice activity detection
- noisy environments
- noisy speech
- visual information
- speech recognition
- speech enhancement
- image classification
- visual content
- visual data
- image retrieval
- background noise
- low level
- keywords
- noise reduction
- speech signal
- speaker verification
- audio features
- speaker identification
- image search
- low level features
- hidden markov models
- automatic speech recognition
- acoustic features
- video shots
- key frames
- multimedia
- language model
- denoising
- high level