Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis.
Desh RajPavel DenisovZhuo ChenHakan ErdoganZili HuangMao-Kui HeShinji WatanabeJun DuTakuya YoshiokaYi LuoNaoyuki KandaJinyu LiScott WisdomJohn R. HersheyPublished in: CoRR (2020)
Keyphrases
- speaker diarization
- speech recognition
- statistical analysis
- recognition rate
- automatic speech recognition systems
- audio visual
- structural analysis
- speaker identification
- document analysis
- speaker dependent
- recognition engine
- speaker recognition
- neural network
- recognition accuracy
- speaker verification
- activity recognition
- automatic transcription
- feature extraction