Login / Signup
EEND-M2F: Masked-attention mask transformers for speaker diarization.
Marc Härkönen
Samuel J. Broughton
Lahiru Samarakoon
Published in:
CoRR (2024)
Keyphrases
</>
speaker diarization
speech recognition
audio stream
learning algorithm
feature extraction
signal to noise ratio
broadcast news