Joint speaker diarization and speech recognition based on region proposal networks.
Zili HuangMarc DelcroixLeibny Paola García-PereraShinji WatanabeDesh RajSanjeev KhudanpurPublished in: Comput. Speech Lang. (2022)
Keyphrases
- speaker diarization
- speech recognition
- speaker identification
- automatic speech recognition
- language model
- hidden markov models
- speech signal
- pattern recognition
- noisy environments
- speech recognizer
- speech synthesis
- speech recognition systems
- handwriting recognition
- bayesian information criterion
- broadcast news
- computer vision
- natural language processing
- probabilistic model