Joint Autoregressive Modeling of End-to-End Multi-Talker Overlapped Speech Recognition and Utterance-level Timestamp Prediction.

Published in: INTERSPEECH (2023)

Keyphrases