Sign in

One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition.

Samuele CornellJee-weon JungShinji WatanabeStefano Squartini
Published in: CoRR (2023)
Keyphrases
  • end to end
  • speech recognition
  • speaker diarization
  • pattern recognition
  • language model
  • neural network
  • computer vision
  • multi modal
  • speaker identification
  • speech recognizer