Visual speaker localization aided by acoustic models.
Gerald FriedlandChuohao YeoHayley HungPublished in: ACM Multimedia (2009)
Keyphrases
- acoustic models
- speech recognition
- automatic speech recognition
- hidden markov models
- speech recognizer
- speaker independent
- visual information
- speaker recognition
- speaker identification
- broadcast news
- multi modal
- discriminative training
- visual features
- low level
- speech signal
- audio visual
- visual data
- training process
- speaker diarization
- em algorithm
- computer vision