Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis.
Desh RajPavel DenisovZhuo ChenHakan ErdoganZili HuangMaokui HeShinji WatanabeJun DuTakuya YoshiokaYi LuoNaoyuki KandaJinyu LiScott WisdomJohn R. HersheyPublished in: SLT (2021)
Keyphrases
- speaker diarization
- recognition rate
- statistical analysis
- audio visual
- automatic recognition
- structural analysis
- speech recognition
- document analysis
- automatic transcription
- recognition accuracy
- recognition engine
- automatic speech recognition systems
- video sequences
- recognition algorithm
- speaker verification
- spoken words