Three-Dimensional Speaker Localization: Audio-Refined Visual Scaling Factor Estimation.
Xinyuan QianQi LiuJiadong WangHaizhou LiPublished in: IEEE Signal Process. Lett. (2021)
Keyphrases
- scaling factors
- three dimensional
- visual information
- audio visual
- visual data
- visual speech
- speaker identification
- localization method
- visual features
- membership functions
- control parameters
- automatic transcription
- lattice vector quantization
- speech recognition
- approximation error
- speaker verification
- prosodic features
- feature extraction
- fuzzy logic controller
- speech signal
- optimization process
- estimation error