Dive: End-to-End Speech Diarization Via Iterative Speaker Embedding.
Neil ZeghidourOlivier TeboulDavid GrangierPublished in: ASRU (2021)
Keyphrases
- end to end
- speaker diarization
- speaker identification
- speech recognition
- speaker recognition
- broadcast news
- audio visual
- speaker verification
- automatic speech recognition
- speaker dependent
- speech signal
- gaussian mixture model
- wireless ad hoc networks
- noisy environments
- high bandwidth
- prosodic features
- congestion control
- multipath
- transport layer
- ad hoc networks
- bayesian information criterion
- hidden markov models
- content delivery
- admission control
- text localization and recognition
- acoustic features
- speech synthesis
- vocal tract
- application layer
- web services