Speaker localization and tracking in the presence of sound interference by exploiting speech harmonicity.
Kai WuShu Ting GohAndy W. H. KhongPublished in: ICASSP (2013)
Keyphrases
- speech recognition
- automatic speech recognition systems
- audio visual
- speaker recognition
- automatic speech recognition
- acoustic features
- speaker verification
- speaker identification
- accurate localization
- localization algorithm
- speech signal
- object tracking
- particle filter
- vocal tract
- prosodic features
- speaker diarization
- sound source
- real time
- position estimation
- kalman filter
- reliable detection
- speech synthesis
- multi modal
- broadcast news
- mobile robot localization
- automatic transcription
- visual tracking
- speaker dependent
- audio stream
- speech sounds
- position information
- audio features
- pattern recognition
- text to speech
- particle filtering
- mean shift
- spoken language
- speech recognizer
- multi stream
- object localization
- moving target
- appearance model
- gaussian mixture model
- language model
- optical flow
- feature extraction