Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition.
Peng ShenXugang LuHisashi KawaiPublished in: CoRR (2023)
Keyphrases
- speech recognition
- automatic speech recognition
- language model
- speech synthesis
- hidden markov models
- speech understanding
- speech signal
- noisy environments
- speaker identification
- speaker dependent
- speaker independent
- speech recognizer
- speaker diarization
- speech processing
- speech recognition systems
- speech recognition technology
- pattern recognition
- speech retrieval
- acoustic models
- speaker adaptation
- speech recognizers
- keyword spotting
- speaker recognition
- bayesian networks
- cepstral coefficients
- speech recognition errors
- feature extraction
- feature selection