Binaural localization of speech sources in 3-D using a composite feature vector of the HRTF.
Xiang WuDumidu S. TalagalaWen ZhangThushara D. AbhayapalaPublished in: ICASSP (2015)
Keyphrases
- feature vectors
- speech recognition
- speech signal
- feature space
- feature extraction
- euclidean distance
- information sources
- gabor filters
- object localization
- image features
- similarity measure
- speech segments
- feature set
- data sources
- texture features
- databases
- support vector machine
- speech synthesis
- multiple sources
- localization algorithm
- transfer function
- speaker identification
- color and texture information
- machine learning
- automatic speech recognition
- gaussian mixture model
- image classification
- noisy environments
- broadcast news
- speaker recognition
- text to speech
- audio visual
- multi lingual
- optic disc
- localization method
- localization error
- mel frequency cepstral coefficients
- multimedia