Complete-linkage clustering for voice activity detection in audio and visual speech.
Houman GhaemmaghamiDavid DeanShahram KalantariSridha SridharanClinton FookesPublished in: INTERSPEECH (2015)
Keyphrases
- voice activity detection
- visual speech
- noisy environments
- audio visual speech recognition
- visual speech recognition
- hidden markov models
- speaker identification
- k means
- speaker verification
- noise reduction
- clustering algorithm
- audio signals
- acoustic features
- speech recognition
- audio signal
- speech signal
- broadcast news
- video signals
- visual data
- unsupervised learning
- multi modal