Login / Signup

One Model to Rule Them All ? Towards End-to-End Joint Speaker Diarization and Speech Recognition.

Samuele CornellJee-Weon JungShinji WatanabeStefano Squartini
Published in: ICASSP (2024)
Keyphrases
  • end to end
  • speech recognition
  • speaker diarization
  • pattern recognition
  • language model
  • speaker identification
  • video sequences