Audio-Video Speaker Diarization for Unsupervised Speaker and Face Model Creation.
Pavel CamprMarie KunesováJan VanekJan CechJosef PsutkaPublished in: TSD (2014)
Keyphrases
- speaker diarization
- audio video
- face model
- human faces
- speech recognition
- keypoints
- facial features
- multimedia
- facial expressions
- face images
- feature points
- active appearance models
- unsupervised learning
- semi supervised
- speaker verification
- broadcast news
- speaker identification
- bayesian information criterion
- online video
- pose variations
- service management
- shape model
- hidden markov models
- pattern recognition