Login / Signup
One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition.
Samuele Cornell
Jee-weon Jung
Shinji Watanabe
Stefano Squartini
Published in:
CoRR (2023)
Keyphrases
</>
end to end
speech recognition
speaker diarization
pattern recognition
language model
neural network
computer vision
multi modal
speaker identification
speech recognizer