Vision-Guided Speaker Embedding Based Speech Separation.
Yuanjie DengYing WeiPublished in: CISP-BMEI (2022)
Keyphrases
- vision guided
- speech recognition
- mobile robot navigation
- speaker recognition
- automatic speech recognition
- audio visual
- speaker verification
- speaker identification
- natural scenes
- speaker dependent
- prosodic features
- speaker diarization
- speech signal
- vocal tract
- automatic speech recognition systems
- gaussian mixture model
- mobile robot
- text to speech
- speech synthesis
- noisy environments
- synthesized speech
- speech sounds
- broadcast news
- multi modal
- acoustic features
- hidden markov models
- sound source
- natural images
- speaker adaptation
- visual speech
- unknown environments
- automatic transcription
- spontaneous speech
- human computer interaction
- visual information
- feature extraction