Deep Multimodal Speaker Naming.
Yongtao HuJimmy S. J. RenJingwen DaiChang YuanLi XuWenping WangPublished in: CoRR (2015)
Keyphrases
- audio visual
- speaker verification
- multi modal
- speech recognition
- multimodal interaction
- multimodal data
- deep learning
- brain image analysis
- speaker diarization
- multi party
- visual information
- gaussian mixture model
- vector quantization
- image sequences
- multimedia
- speaker identification
- computer vision
- artificial intelligence
- machine learning
- data sets
- visual speech
- prosodic features
- automatic transcription
- database