Perceptual annotation of expressive speech.
Lijuan WangMin ChuYaya PengYong ZhaoFrank K. SoongPublished in: SSW (2007)
Keyphrases
- speech recognition
- semantic annotation
- speech signal
- active learning
- low level
- text to speech
- audio visual
- metadata
- automatic annotation
- automatic speech recognition
- manual annotation
- perceptual organization
- automatic image annotation
- perceptual grouping
- human perception
- recognition engine
- speech synthesis
- language acquisition
- human visual system
- multi stream
- endpoint detection
- speech processing
- weakly labeled
- speaker recognition
- broadcast news
- human vision
- dialogue system
- spoken dialogue systems
- video annotation
- noisy environments
- spontaneous speech
- video retrieval
- image retrieval
- learning algorithm