Varying Microphone Patterns for Meeting Speech Segmentation Using Spatial Audio Cues.
Eva ChengIan S. BurnettChristian H. RitzPublished in: PCM (2006)
Keyphrases
- speaker diarization
- audio stream
- motion cues
- audio visual
- prosodic features
- broadcast news
- automatic speech recognition
- bayesian information criterion
- visual information
- text to speech
- meeting room
- spatial patterns
- speech recognition
- speaker identification
- spatial information
- multimedia
- speaker verification
- segmentation algorithm
- segmentation method
- video segmentation
- figure ground
- image segmentation
- shape prior
- multiscale
- spatial constraints
- spatial data
- appearance cues
- speech processing
- visual cues
- spatial features
- noisy environments
- digital audio
- audio recordings
- level set
- spatial context
- audio signals
- spatio temporal
- speech synthesis
- audio features
- linear predictive coding
- cepstral features
- multiple cues
- speech signal
- spatial and temporal
- medical images
- acoustic features
- audio video
- object segmentation
- signal processing
- fundamental problems in computer vision
- multi stream
- multi modal
- acoustic signals
- multimodal fusion
- hidden markov models