Multimodal speaker identification with audio-video processing.
Yücel YemezAlper KanakEngin ErzinA. Murat TekalpPublished in: ICIP (3) (2003)
Keyphrases
- speaker identification
- video processing
- speech recognition
- video analysis
- gaussian mixture model
- broadcast news
- video segmentation
- video compression
- signal processing
- speech signal
- feature extraction
- video surveillance
- noisy environments
- image processing
- audio visual
- audio features
- real time
- multi modal
- video data
- hidden markov models
- computer vision
- pattern recognition
- video retrieval
- multimedia
- video coding
- neural network