Alleviate Cross-chunk Permutation through Chunk-level Speaker Embedding for Blind Speech Separation.
Rongzhi GuJunyi PengYuexian ZouDong YuPublished in: APSIPA (2019)
Keyphrases
- speech recognition
- audio visual
- speaker verification
- automatic speech recognition
- speaker recognition
- speech signal
- speaker identification
- vocal tract
- automatic speech recognition systems
- prosodic features
- speaker diarization
- information hiding
- noisy environments
- higher level
- information retrieval
- emotion recognition
- speech synthesis
- hidden markov models
- speaker dependent
- speech sounds