Effect of speed difference between time-expanded speech and moving image of talker's face on word intelligibility.
Shuichi SakamotoAkihiro TanakaKomi TsumuraYôiti SuzukiPublished in: J. Multimodal User Interfaces (2008)
Keyphrases
- input image
- image data
- speech recognition
- image analysis
- image classification
- image features
- image representation
- image segmentation
- lighting conditions
- multiscale
- single image
- n gram
- image content
- keypoints
- low level
- feature points
- real time
- co occurrence
- adjacent pixels
- image retrieval
- face recognition
- image set
- image collections
- illumination conditions
- recognition engine
- real world objects
- camera movement
- spontaneous speech
- english text
- integral image
- pixel values
- segmentation method
- edge detection