Login / Signup

EEND-M2F: Masked-attention mask transformers for speaker diarization.

Marc HärkönenSamuel J. BroughtonLahiru Samarakoon
Published in: CoRR (2024)
Keyphrases
  • speaker diarization
  • speech recognition
  • audio stream
  • learning algorithm
  • feature extraction
  • signal to noise ratio
  • broadcast news