NTT Speaker Diarization System for Chime-7: Multi-Domain, Multi-Microphone end-to-end and Vector Clustering Diarization.
Naohiro TawaraMarc DelcroixAtsushi AndoAtsunori OgawaPublished in: ICASSP (2024)
Keyphrases
- speaker diarization
- end to end
- multi domain
- bayesian information criterion
- speech recognition
- cross domain
- k means
- congestion control
- model selection
- broadcast news
- clustering algorithm
- domain specific
- gaussian mixture model
- feature vectors
- mixture model
- feature selection
- heterogeneous networks
- error rate
- data points